Knowledge Graph Objects in Google CSEs (True Semantic Search!)

booleanstrings Boolean Leave a Comment

 

As I was finishing the “Hacks” slides for my favorite conference, Sourcing Summit Europe, I stumbled across something I hadn’t seen before. Google Custom Search Engines (CSEs) got a new setting in the control panel:

We can now select Knowledge Graph Objects to restrict the search! I was intrigued; David Galley and I spent some time researching what the new option does.

We have been taking advantage of including Schema.org objects in CSEs since 2016. (See an example of the technology in this post, and we use it in Social List). The new setting is quite different.

While selecting Schema.org Objects rely on metadata placed there by web page creators, choosing a KG Object does not. Instead, it reflects Google’s “idea” of which KG Objects are relevant to the page. Therefore, the new option covers a much wider range of sites that can show up in the results. I say this is a true semantic search in a global search engine!

There is a just-announced API for selecting Knowledge Graph Objects, but there is not much explanation from Google how it works. (It could be quite complicated in the back-end, and is undoubtedly updated continuously.) The best way to examine what happens when we use the setting is to create CSEs and see what shows up in the results. (Get your hands dirty and your mind clear 😉).

We can restrict to Schema.org or KG Objects only in Custom Search Engines, but not on Google.com, and it’s time for Sourcers to get to apply the technology. It may sound quite technical, but it’s not; everyone should able to get KG Object searches going. Please come learn all about the new CSE capacity (and everything else about CSEs) at our webinar on October 15th, 2019.

Here are a couple of details on the KG selection, and four examples.

We can select up to five KG Objects, and the search will look for an OR of the five. However, we can search for an AND of two objects with the use of the CSE refinements.

Examples of CSEs that search for:

  1. Females –  example search: java developer san francisco
  2. Jobs – java developer san francisco
  3. Curriculum Vitae – java developer san francisco
  4. OSINTmaltego

Come join us at Become A Custom Search Engines Expert class coming up this week, for an in-depth look at all the options you can set in the CSEs. Seating is limited.

 

 

 

New: Google College Search

booleanstrings Boolean 4 Comments

While Google has posted a blog about college search, they did not tell us how to get to that advanced college search dialog, which looks like this for me:

You can search through tons of useful info – Program, Location, Average cost. Tuition, Type, State, Acceptance rate, Size, and Campus setting.

The secret to getting to this dialog is adding this piece – &ibp=htl;splinter – to your search URL. This was my search.

The new capability doesn’t work outside of the US yet (you will see a “not supported” dialog if you are outside of the US) and shows US schools only. But I expect it will be expanded globally. If you want to use the university dialog and are located outside of the US, run one of the IP address changers such as Tunnelbear.

For other Sourcing Hacks, please check the second edition of our eBook!

Talent Pipeline Decline?

booleanstrings Boolean 2 Comments

One of the few paid Sourcing tools I use is RPS (LinkedIn Recruiter, or LIR, for Agencies). It is our highest yearly expense, but we have been choosing to stay with it for a number of years now. My favorite feature of LIR has always been “Talent Pipeline” (the name doesn’t really fit), which is, in fact, the import function. As you import Excel files into LIR, the records are matched to LinkedIn profiles, and, as import finishes, you can search across the combined info. It’s a powerful Sourcing technique and has been my frequent go-to when Sourcing. The import function has also been quite reliable.

With the just-released new UI, we still have the import function, but LinkedIn has cut off its features to the point where its usefulness sounds like a question mark to me. Import appears to be a much weaker function in the new version.

Here is a brief summary of the changes.

There is no longer a way to match imported fields with LinkedIn’s.

The file to import (it seems) must be CSV-formatted with exactly these columns:

  1. First name
  2. Last name
  3. E-mail address
  4. Phone number

We used to be able to have one column with first/last, giving us some flexibility. But what is worse, I don’t see a way to import any additional values (that used to go to Notes – and we could then search by them!) Import crashes if a column is added to the import file.

(Before the changes, I was thinking that import would become even more powerful if we could match imported columns with custom fields. Forget about that now!)

And what’s worst of all – at least in my experiments, LIR can import only so many records at once. I was able to import a file with 119 records, but anything over ~140 failed with an error message.

We used to be able to import up to 5K records at a time. Is this a bug? Unless this is fixed (or we find a workaround 😉 ), the Recruiter import feature, a.k.a. Talent Pipeline will have much less value for Sourcers.

Let’s stay optimistic and hope that we will get the full functionality back.

Don’t miss our Productivity Tools for Sourcing webinar on Wednesday, September 11th! We’ll be talking about today’s best tools for all aspects of your Sourcing life. (Most tools covered are free).

How to Restore Image Search Functions

booleanstrings Boolean Leave a Comment

Quite unfortunately, Google has just removed three useful options in its Image Search:

  1. Option “Face” from “Types”,
  2. Option “Photos” from “Types”, and
  3. An ability to enter an exact image size.

Here is an example of what it used to look like until just a few days ago. This Image search looks for “Faces”, size 200×200 – with the idea to find profile pictures on LinkedIn:

… and here is what we see now (with no ability to enter an exact image size either):

(Why did Google do this?)

Here is how to compensate for the losses.

1.-2. Solution:

You will get both “Face” and “Photos” filter back by using Google Advanced Image Search URL.

3. Solution:

You will get searching by an exact size back by using the search operator imagesize: (did you know about it?)

(Note that when you press “Enter”, the operator will disappear, but the search will be filtered).

By the way, there is another Google Image Search operator with similar behavior (disappearing after “Enter” is pressed) – and that is filetype:, followed by one of the Image filetypes – JPG, GIF, PNG, BMP, SVG, WEBP, ICO, or RAW. Interestingly, after you have used the filetype: operator, you will get an extra menu for filetypes:

All three lost features can also be accessed by altering the search URL. Let’s memorize what those URL parameters spell out like for image types, sizes, and colors, and keep these strings, in case Google takes more options away from us in the menu.

  • Large images: &tbs=isz:l
  • Medium images: &tbs=isz:m
  • Icon sized images: &tba=isz:i
  • Image sized exactly 200×200: &tbs=isz:ex,iszw:200,iszh:200
  • Images in full color: &tbs=ic:color
  • Images in black and white: &tbs=ic:gray
  • Images that are red: &tbs=ic:specific,isc:red (orange, yellow, green, teal, blue, purple, pink, white, gray, black, brown)
  • Image type Face: &tbs=itp:face
  • Image type Photo: &tbs=itp:photo
  • Image type Clipart: &tbs=itp:clipart
  • Image type Line drawing: &tbs=itp:lineart
  • Image type Gif: &tbs=itp:animated
  • Show image sizes in search results: &tbs=imgo:1
  • Search for filetypes: &as_filetype=png (will get you a new filetype menu as when searching by the operator)
  • X-Ray: &as_sitesearch=linkedin.com (will instert a string site:linkedin.com into search)
  • Localize to country: &cr=countryNZ (a two-letter country abbreviation goes at the end)

So here, we have learned how to restore each piece of the disappeared functionality – and also about additional rather “hidden” filters.

Don’t miss the Productivity Tools for Sourcing webinar on Wednesday, September 11th!

How to Do Executive Job Title Research

booleanstrings Boolean Leave a Comment

Often, especially when sourcing for executives, we need to answer questions like these):

What are possible job titles at a particular level of seniority, in a given industry (or at a company), with given functions?

Equipped with this intelligence, we can start constructing filtered people searches. Without this research upfront, we would be encountering both false positives and false negatives when searching.

While doing open-ended searches (on LinkedIn, for example) and eyeballing results is useful, at times, we want to get lists of job titles that are as full as possible. For that, we can X-Ray some sites with profiles, such as Zoominfo, and scrape search results with a tool like Instant Data Scraper. In X-Rays, we’d include keywords for job titles we are looking for, such as chief, head, director, senior vice president, etc.

For example, we can search like this:

site:zoominfo.com/p intitle:accenture “* director”

and scrape results. We will need to clean up the collected data a little, but we can get a reasonably full list of job titles, that include the word director, at Accenture. Note that if we are able to get Google to highlight the exact job titles in the results (for example, by searching for “chief * officer”), we would get a clean output of the titles in a separate column with Instant Data Scraper.

The contact-finding site contactout.com has public profiles and we can X-Ray it in the same manner (then, scrape results):

site:contactout.com intitle:”credit suisse” director “united arab emirates”.

Yet another site, RocketReach, can be used for the same:

site:rocketreach.co intitle:walmart intitle:chief “chief * officer”.

As an example output, here are the titles of Chief Officers at Walmart found with the above string:

Chief Administrative Officer
Chief Business Development Officer
Chief Communications Officer
Chief Compliance officer
Chief Culture Diversity&Inclusion Officer
Chief Customer Officer
Chief Data Officer
Chief Ethics & Compliance Officer
Chief Ethics Compliance Officer
Chief Ethics Officer
CHIEF EXECUTIVE OFFICER
Chief Information Officer
Chief Information Security Officer
Chief Legal Compliance Officer
Chief Marketing Officer
Chief Merchandising Officer
Chief People Officer
Chief Procurement Officer
Chief Product Officer
Chief Revenue Officer
Chief Technical Officer
Chief Technology Officer

Sure enough, we can also X-Ray LinkedIn for the same purpose. Constructing searches is straightforward since public LinkedIn profiles have both job titles and company names in the page titles.

We can get a combined job title list from each of these types of X-Ray searches, and this would inform our people searches.

Join me for a brand-new webinar Executive Sourcing Techniques on Tuesday, September 17th to learn more!

17 Custom Search Engines

booleanstrings Boolean 3 Comments

Here are some Google Custom Search Engines for Sourcing I have created:

  1. Google No-Captchas(a.k.a. Search Like a Human)
    Public link: Search Like a Human– just search for anything without being tested for not being a bot
  2. Email Formats – use to discover email patterns for any corporation
  3. LinkedIn – Countries– X-Rays LinkedIn profiles; offers a dozen refinements by country
  4. Emails in Resumes– not only looks for resumes but also pushes email addresses in the resumes to be seen in snippets
  5. Document Finder– looks for documents that are stored in one of a dozen popular document storage sites, such as Slideshare
  6. File Types– looks for certain file types such as Excel and PDF. It is helpful if you are searching for lists or resumes
  7. Software Engineers in the Bay Area– exactly what it says (created by Julia)
  8. Hidden Resumes– triggers a resume search without any search operators. It is used on the site http://hiddenresumes.com
  9. Diversity Associations
  10. http://bit.ly/LinkedIn-XRay – just LinkedIn X-Ray
  11. http://bit.ly/LanguageSpeakers – X-Ray LinkedIn for language proficiency (search by a language name)
  12. http://bit.ly/developerresumes – Developer resumes
  13. http://bit.ly/GithubRepos – find Github users by programming languages
  14. http://bit.ly/Find-Accountants – find Accountants
  15. http://bit.ly/Find-Physicians – find Physicians
  16. http://bit.ly/hiddenprofiles – find social profiles (my most popular CSE!)
  17. http://bit.ly/findpersons – find people on the Internet

Learn about Custom Search Engines and other tools at our webinar on Wednesday, September 11th – Productivity Tools for Sourcing!

Also, learn about scraping tools in the recording Web Scraping For Recruiters and Custom Search Engines in Become A Custom Search Engines Expert.

Three LinkedIn Recruiter Sourcing Secrets

booleanstrings Boolean Leave a Comment

When we talk with our sourcing and recruiting clients and our students about paid tools they invest in, we almost invariably learn that all or most Recruiters on the team have LinkedIn Recruiter (LIR) subscriptions. Our recent Facebook group poll has confirmed the same. Yet the majority of LIR users are unaware of its thee important features, that I cover below. It’s no wonder – help documents barely cover the features or not at all.

If your team uses LinkedIn Recruiter, consider getting the just-updated lecture LinkedIn Recruiter and the Talent Pipeline and learn about these and other tips about the tool, that you won’t learn elsewhere.

  1. LIR advanced search is “not what it seems to be” – and not Boolean. With pretty much every search facet, there are interpretations of your input, that are mostly unintuitive and unanticipated (and are good to know about). I have written about some in my past blog posts (for example, LIR, unknowingly to users, brings in what “it thinks” are synonyms of a selected job title).
    I have recently run into mysterious interpretations for location searches as well, applicable both to searches by a zip code/radius and by an area name. Here is, simply, a search for members in San Francisco Bay Area – yet it shows thousands of people who live outside of the Bay Area:
  2. The Import, a.k.a. “Talent Pipeline” is an incredibly powerful function included in LIR, yet about 90% of LIR users still seem to have never heard of it. (The name is misleading, too). You can import Excel spreadsheets and combine the imported data with LinkedIn’s in your work – this opens up a variety of use cases. Additionally, you would save on InMails. I wrote about it back in 2015.
  3. You can potentially reduce the number of LIR seats for your team by assigning “Hiring Managers” roles to some team members while keeping the team productivity up. Also covered in 2015.

If your team uses LinkedIn Recruiter, consider getting the just-updated lecture LinkedIn Recruiter and the Talent Pipeline and learn about these and other tips about the tool, that you won’t learn elsewhere.

How to Scrape Github Profiles in One Minute

booleanstrings Boolean 7 Comments

Here is a delightfully easy way to get an Excel spreadsheet from a search for Github users.

(See an exclusive offer for Brainfood subscribers below).

  1. Install Chome Extension AutoPagerize. It will allow appending the 2nd, 3rd, etc. search results pages to the bottom of the current page, creating one long page that contains all the results (or as many as you wish). It works in Google search results as well as in Github search results.
  2. Search for the languages and locations on Github – for example, language:java location:amsterdam:
  3. Scroll down, letting AutoPagerize create a long page containing as many results as you wish.
  4. Install and run Chome Extension Instant Data Scraper from webrobots.io.

Voila – you can now export scraped, parsed results into Excel:

(Note that in this case there’s no need to locate the “Next” button since all the results are within the page.)

Enjoy!

Did you miss our webinar “Web Scraping For Recruiters”? This week’s presentation was sold out, and we are repeating it on Tuesday, August 6th, followed by an optional hands-on Workshop on August 7th. We will cover scraping tool selection and multiple tools such as Data Miner, Phantombuster, ZapInfo, Outwith Hub, and more. Seating is limited.

Non-Technical Scraping in Sourcing

booleanstrings Boolean 2 Comments

 

Web Scraping tools are rapidly gaining popularity among Sourcers and Recruiters. They are an essential part of a productive Sourcer’s toolbox.

Here is an example of how scraping may help your work. Suppose you see a webpage like this (listing some promising prospects):

If you wanted to examine and save the records, this format is quite inconvenient. Without tools, you’d be endlessly copying and pasting “by hand.”

But, take a scraper like Instant Data Scraper, and in seconds you will get an Excel file with parsed information:

Clearly, this format is easier to work with. You can search, sort, and filter columns, add info to records, upload to other systems, and use this data in combination with other data.

As an example, if you have a LinkedIn Recruiter account, you can upload the file using Talent Pipeline. Uploading will combine the scraped data with LinkedIn’s, and you can then search within the “enriched” LIR records.

While some scraping techniques require coding, for the majority of our tasks, we can get results with simple-to-use tools like Instant Data Scraper, Data Miner, and Phantombuster. Outwit Hub is a more sophisticated tool, but it can do simple things simply and “knows” how to extract contact information.

Join us for the first-ever class “Web Scraping for Recruiters” on Tuesday, July 30 at 9 am PDT, with an optional workshop on Wednesday, July 31st, to learn all about non-technical scraping for Sourcers and Recruiters, and start using this technique.

[Edited] Repeating the sold-out webinar on August 6th, 2019! Register here: https://sourcingcertification.com/webscraping/.

 

Google filetype: News

booleanstrings Boolean Leave a Comment

Google.com has quietly improved its filetype: searches. Previously, it was looking (simply) for the part of a page URL that ends in the filetype: argument, and it can still do so:

filetype:tonini (finds Facebook profiles in particular).

However, for standard, common file types, Google now also searches for all files of a given kind (such as MS Excel), with a variety of extensions. I.e. Google now searches according to the “true” file format (vs. just the string in a URL after the last period). For example, consider this search:

attendees filetype:xlsx -xlsx.

You will be finding MS Excel files with extensions other than XLSX, such as XLS, XLTX, etc. The same is true for other standard file types such as MS Word or PDF – Google will find other file extensions. That is quite helpful for our searches. (Note that Google’s help doesn’t tell us either about non-standard extensions like tonini or searching for true file types such as Excel).

In a hurry? You can use a shorter operator ext: on Google – it works the same way as filetype:.

Check out the fully updated 4th edition of my eBook Boolean Book, that has just “shipped”. The eBook includes X-Ray templates for Sourcing – including multiple filetype: searches – as well as several new Custom Search Engines and ideas on how to search productively.