Please Say No to OR on Google

booleanstringsBoolean, Google

As I tweeted the other day, “I celebrate each time I talk a recruiter into stopping using long ORs – or any ORs – on Google. Stop – it is an outdated, ten-year-old technique.”

And I do not mean using | instead.

Here are some brief notes on the subject – and I hope to convince you, too!

1. Using OR defeats useful Google’s semantic interpretation of your search

What happens if you use the OR operator, for example, search for Developer OR Engineer? Google will find pages with “Developer” and pages with “Engineer.” It mixes them up and displays. But it will have a hard time deciding which should go first because you have asked about several different things, not specifying priorities. (Does it make sense?)

If instead, you search for (just) Developer, Google will find Developer, developing, development, Software Engineer, Coder, Programmer, and others who develop code and show the most relevant pages first.

On a tight search, with no ORs, you will get more results than with ORs. You can test it yourself.

Using ORs for synonyms and similar terms on Google is counter-productive. If you want to include either term in your search, it’s best to run two or more consecutive searches, where you will know what you are looking to find in each.

2. Using OR does not help to find many terms on one page

For example, if you search for Accenture OR Deloitte, Google will run queries with one, then another, and mix up the results pages. It won’t prioritize pages with both terms if that is what you wanted.

3. If you must use ORs, look into Custom Search Engines

In some cases, you will have a long series of search terms that are not synonyms – for example, school or company names. If you must search for a long Boolean OR of terms, Google will not help very much because its searches are limited to 32 words. This approach can help, though it’s a bit technical. But you may want just not to utilize Google for those.

I will be sharing this type of content, with many how-to examples, at our upcoming

Online Sourcing Learning Day, May 6th.

You must join us!

 

 

 

 

Invitation: Online Sourcing Learning Day – May 6th

booleanstringsBoolean, OSINT

Have you ever wanted to make a big jump in your sourcing skills in just a day? I would like to invite you to join David Galley, Guillaume Alexandre, Kim and Gordon Lokenberg, Balazs Paroczay, Marcel van der Meer, and me on May 6, 2020. We will present a unique six-hour online event with an in-depth, diverse, actionable content on Sourcing and Recruiting. So, don’t wait and head over to:

Online Sourcing Learning Day May 6, 2020 Registration

You’ll find the details on the site, but here are some points, briefly:

For those with a serious interest in sourcing: this is going to be a one-of-a-kind event.

Presenters are all experienced sourcing hands-on practitioners. We are also all trainers and speakers; most of us have met at Phil Tusing’s Sourcing Summits and have become good friends since. I have listened to each of our speakers and guarantee you inspiration and quality, up-to-date content, delivered from diverse angles.

Our topics cover “everything,” from running intelligent Google searches to recruiting unicorns to peeking inside a Sourcer’s mind to OSINT techniques.

We will be running a side-by-side virtual coffee bar and break room throughout the event. Get together with your favorite presenters and ask any questions you like!

Not up for listening six hours straight? Want to review some points later? All who sign up will get the slides and video-recordings for you to keep and use to practice what you have learned.

We have priced the event to be affordable. We expect participants from many countries, with the majority being from the US and Europe (hence, the times of day). Please help us to promote Learning Day by sharing the link on Social Media and with colleagues. Our hashtag is #OLSD. Your support is appreciated.

We will be able to accommodate teams on an individual basis. (Tip: a “team” does not have to come from one organization. Contact your colleagues and form a team.)

I hope to “see” you there! Sign up now.

 

 

 

Search for Women in Your Industry: 5 Tips and a Bonus

booleanstringsBoolean, Diversity

I live in a very diverse area (and love it). Let me demonstrate what it is like in the streets and parks, when people are out, by showing a screenshot for local Python Developers (no diversity filters applied):

However – search for CEOs in the same area and you will see a very different picture:

(A stunning difference, isn’t it?)

Different industries, locations, and career levels have their diversity employment challenges. And every company, at every level, benefits from hiring diversity for all roles.

Let’s talk about searching for Women, with the purpose of including them in the talent pipeline. Our webinar this week will serve as a Complete Guide to Sourcing Females. I have forty slides on that. 👩👩‍🦰👱‍♀️ In a short blog post, I will mention a few approaches that will complement your diversity efforts.

1. Searching in a Diverse Area? Use Women’s Names for Ethnicities.

It is highly applicable here in the San Francisco Bay Area, where Indian and Asian Software Engineers are a majority. I wasn’t finding many profiles on LinkedIn for my diversity project by searching for common (English) women’s names until this occurred to me! Strings I had started using then, like an OR of Indian women’s names – (Aditi OR(Bipasha) OR(Damayanti) OR(Eva) OR(Gayatri) OR(Harshita) OR(Indira) OR(Jesminder) OR(Kanika) OR(Lata) OR(Madhumita) OR(Nadia) OR(Padmavati) OR(Rajadhi) OR(Saloni) OR(Tanisha) OR(Uma) OR(Varsha) OR(Yamini)) (etc.) – brought the desired results.

2. Continuing on the fruitful “image for *” technology exploration, started by my friend @theBalazs, search for diverse colleges and organizations. For example:

or, just this (add your terms):

Note that you will find additional results with

Switch to the Image Tab to see how accurate your search is.

3. Have you noticed that diversity searches often require long lists of terms, such as relevant associations or schools, as part of the approach? Custom Search Engines can automate relevant searches for you.

Here is my X-Ray LinkedIn for Women’s Names.

4. Some sites allow us to X-Ray by gender:

We could also create one for CrunchBase, since member profiles have the gender info.

5. Every industry has associations oriented toward women, which you can find by Googling. For each site, you could potentially be constructing searches like:

and find profiles of females in that industry.

Bonus. Here are two additional Diversity CSEs:

Please join me at the fully reworked webinar “Sourcing for Diversity” on April 15th, 2020. Seating is limited.

Avoid These Nine Mistakes on Google

booleanstringsBoolean, Google

Google indexes 35 trillion web pages. (Compare the volume with LinkedIn’s. LinkedIn profiles are 0.001% of Google’s Index!)

However, mining Google is not straightforward because the web has different kinds of pages. We can search for terms in the page titles, URLs, or links to the page but usually not for values like job titles or companies. If you want a reminder, here is The Full List of Google Advanced Search Operators.

Compared to, say, ten years ago, Google’s search has changed dramatically, not only adding trillions of pages but learning to recognize user’s intent when they are searching. That is a definition of Semantic Search. Because of clever Google’s interpretation of search strings, we need to be Masters of Boolean – and know when to let Google control results as well.

From our experience training, we have consistently seen two kinds of attendees. Some have an open mind and try to “get it” and apply what they learn right away. We offer a month of support on all our offerings to stimulate that – and are celebrating with the students when they get to understanding and success. Others, especially self-identified “old school” recruiters, copy and keep search strings or use Boolean Builders, all of which have less than optimal templates, or write very long strings on Google. They do not try to understand why the results look like this or that on the screen.

However – everybody can learn, and many have! Boolean search is not Rocket Science. All you need is an open mind and a computer with a browser and wi-fi.

If you are Googling, do overcome these mistakes and unhelpful habits:

  1. There is no operator AND
  2. NOT needs to be written as the minus -, no space between the minus and keyword
  3. Parentheses are ignored. OR is always a priority (unlike it is on Bing or LinkedIn)
  4. Operators like site: must be lower-case, no space between the operator and keyword
  5. Do not trust or compare the numbers of results
  6. Put your terms in the order you expect to see them
  7. Do not try to “catch everything” with one search string. Strings are not “built”, they are run and immediately modified
  8. Do not be a perfectionist. Searches will have false positives and miss something. Your goal is not strings but results
  9. Overusing the operator OR leads to shrinking results – search simple

I cannot stress #9 enough. It is not a good idea to use ORs on Google. (I never do). I can give you numerous examples of how simple search works better than long ORs. It is time to change this habit.

Please join me at our fully refreshed webinar “Boolean Basics & Beyond” coming to your laptop at home on April 7, 2020. We will mostly cover Google but will talk about other sites as well. Seating is limited (due to our need to support everyone who signs up).

 

How to Correctly X-Ray LinkedIn for Headlines

booleanstringsBoolean, LinkedIn

It is quite unexpected – and 99.99% of LinkedIn members do not realize that – but in LinkedIn people search, Headlines are not taken into account! (Easy to check). The Headline is the main intro on your profile that you want the world to see but you cannot be found by it. OOPS, LinkedIn. The moral of the story is that you might want to check that your Headline wording is repeated in your Summary so that others will find you as you expect.

Google, of course, does not discriminate between parts of profile pages, and you can search for anything, Headlines included.

Better yet, Google Custom Search Engines (CSEs) provide us with a unique way to search specifically in Headlines! That compensates for LinkedIn’s neglect of its own design. Thanks to my friend and Master Sourcer Pierre-André Fortin for pointing it out. Somehow I’ve been missing it.

While there has been a negative shift in the relationship between Googlebot and CSEs, one (the only one as far as I can tell) way to narrow down to a field that currently works, Headline, is described below. (Note that searches in this post will only find profiles indexed more than six months ago due to the relationship going wrong around that time. CSEs “think” that current profiles do not even have a Person object. It is such a loss! I hope to see it fixed.)

The syntax for the Headline search is not pretty. But you will clearly see which part you need to copy (the operator) and which you can vary (the arguments). The Asterisk * means AND. This search –

more:p:metatags-og_title:javascript*python

finds profiles of members who have both words javascript and python in their Headlines (but obviously not in the job titles). The results are JavaScript and Python enthusiasts.

(To note, the link above is the best LinkedIn X-Ray CSE, whether you use operators or not.)

We have seen many cases in our sourcing practice where Headlines had uniquely qualifying information and brought up additional results, that couldn’t be found on LinkedIn.

Please join me at our class

Overcoming LinkedIn’s Limitations, Wednesday, April 1st, 2020

to get fully updated on search algorithms, workarounds, less-known functionality, and X-Raying of LinkedIn. As we all know, LinkedIn changes are vast, restricting, undocumented, and unannounced. Spend interesting and productive 90 minutes to get up-to-date, choose the right sourcing tracks, and feel confident. Slides and video are included, as well as 30 days of support from us. Note that seating is limited. Those who can’t attend at the scheduled time will get all the materials and support if they sign up.

 

 

 

 

 

more: Healthcare Sourcing Techniques

booleanstringsBoolean, Diversity

In sync with the times, I want to share some Healthcare sourcing tips to implement in addition to what you are already doing.

I know that sourcing for Healthcare professionals is challenging from experience. Last year, I ran a project sourcing for (bedside) Registered Nurses with 2-3 years of experience for a hospital in Texas. While LinkedIn is a site that can verify the length of experience, it produced a ridiculously small outcome: 200 RNs out of tens of thousands RNs who live in Texas.

However, the industry has its advantages in terms of finding info online, including:

  • There are searchable databases
  • Hospital websites have bios and contact lists
  • Hospital websites have “structure” for filtered search via Custom Search Engines

Sites, where you can search for doctors and nurses, include doximity.com, healthgrades.com, vitals.com, and zocdoc.com (Google these together and you will find more). Physicians’ profiles usually have degrees, licenses, specialization, education, affiliation, address, phone number, and ranking. Some profiles have more info, including email or gender. However, searching within the sites is limited.

Luckily, these sites have public profiles, which we can X-Ray, for example, like this:

site:doximity.com/pub gastroenterology intitle:”San Francisco”.

Examine profile titles and URLs for each site to see if it’s possible to search for specific values such as specialty with inurl: or intitle:, like I did in the example.

And we can do even better using Custom Search Engines (CSEs). I used to work with doctors back when I was a Software Developer and have found them to be “computer-unfriendly”. So I was surprised to discover that healthcare-related sites like the ones above, as well as websites of hospitals, are rich with structured Schema.org-based information, which we can query. (Hospitals may have strong IT staff or use some standard hospital website-building tools, I don’t know.)

Schema.org provides healthcare webmasters with many Objects such as Hospital, Physician, and MedicalOrganization. Now, if you are prepared to deal with gibberish-sounding “more:p:” search operators, you can take advantage of searching for Objects’ values. If you are not, sorry! Please read some other posts.

In the following examples, I will be using the CSE Search Everything, which has no restrictions. (The reason to use a CSE vs. Google.com is that only CSEs support operators to query pages’ structure.) You can modify each of the links below to serve your needs.

Physician search across sites:

more:p:physician plastic surgeon cleveland.

If you start seeing many results from one particular site in searches like the above (which is likely to happen), you may want to X-Ray that site alone and also search excluding the site.

Search for Physicians by specialty (add your keywords):

more:p:physician-medicalspecialty:dermatology.

Search for keywords in description:

more:p:physician-description:ear*nose*throat

(The Asterisk * serves as an AND operator).

Search for hospitals by name:

more:p:hospital-name:saint*mary

Whoever worked on creating Schema Objects had a weakness for all things medical:

There is even an Object for Medicalwebpage. There is lots to explore.

Among general search sites, Doximity provides especially rich structure to query:

Our tool Social List has Agents for Doximity and Healthgrades, allowing you to search without writing out operators and export results in Excel.

If you have any questions, please let me know.

 

 

 

 

 

 

LinkedIn Activity Called Working from Home

booleanstringsBoolean, LinkedIn

I have just discovered that X-Raying LinkedIn on Google in the following manner:

site:linkedin.com/in “image for * * activity called <keywords>”

– finds profiles that have shared a status on LinkedIn that includes <keywords>. Combined with professional terms such as skills and job titles, the search can point you to people who are more open to new opportunities, as an example. We are sort-of getting a flavor of structured search, similar to this example, since the <keywords> after “activity called” would be searched for in member’s shares only. (I’ll work on saying the above more clearly if you feel you need it.)

Here are some examples (to be combined with other parameters):

These X-Ray searches can provide value, both as a sourcing tool, looking for individuals, but also as a research tool, looking, for example, where work from home is popular or who is hiring and firing.

If you have other ideas on what to include along with the “activity called,” please share!

 

Hack: Search for Female Names with This CSE

booleanstringsBoolean, Diversity, Hack

Are you searching for diversity? Using the same “synonyms” trick as described in Hack: Use 500 Keywords, Not 32, on Google, I have created a Custom Search Engine that looks for the 50 most popular women’s names. Add location, title, or skills, and you will get a list of women’s LinkedIn profiles with the required qualities.

Example search: python developer bay area. Here is what it looks like:

The results may have false positives, but the majority will be on target.

You can create other diversity CSEs using synonyms, for example, a list of Latino last names, or African-American colleges. Let me know if you want me to make one for you.

Please join us for the updated webinar “Sourcing Hacks” on March 18th, 2020. We will share some new hacks (and some old ones are gone).

Google CSEs Go Wild with LinkedIn URLs

booleanstringsBoolean, Google, LinkedIn

This post will be of interest to those who use or create Custom Search Engines, in particular, to X-Ray LinkedIn for profiles (and avoid high LinkedIn subscription fees). The post is somewhat technical except for the first few paragraphs. We have a webinar on CSEs if you want to dig deeper into CSEs.

You might have noticed that the LinkedIn X-Ray Custom Search Engine you may have bookmarked, that used to work, will now return few or no results on seemingly open-ended queries.

Here is a new CSE to X-Ray LinkedIn that fixes that issue – I recommend using this one:

http://bit.ly/BetterLIXray

Now, for the technical part. For those who use advanced CSE search operators and create CSEs, here is what has happened over the past few months. We are seeing more than one unexplained behavior; it applies to LinkedIn URLs only. Nobody seems to have any clue why this is happening, so this information below is more like a weather report, not a “how” and “why” post. (We had to work very hard to “fix” the LinkedIn Agent in Social List).

  1. CSEs no longer “see” the Person object in public LinkedIn profiles, and neither do CSE APIs. Profiles indexed more than a few months ago still have “Person” in Google’s cache and can be queried, but newer profiles don’t have it (and do not have “Hcard” either). Because of that, searching with more: operators will find only older, indexed a while ago, profiles. Creating a Person-based CSE, where you specify the object in the Control Panel, will also limit the results to old.
    (There is info on “Person” – for which we would love to search – when you examine the source code for public profiles, but somehow Google fails to see it).
  2. If you include both “linkedin.com/in” and “linkedin.com/pub” as sites to X-Ray in your CSE, it will be producing very few results, with the keywords mostly in page titles.
  3. If you include “site:linkedin.com/in OR site:linkedin.com/pub -pub.dir” as a string to add to each query and make the CSE search everywhere, it will work for a CSE definition. That is how I built http://bit.ly/BetterLIXray. However, if you make an API call for that CSE, it will be finding pages on LinkedIn, but all sorts of pages, like jobs – not just profiles. Odd.
  4. You can find profiles in particular countries by including URL templates, with the use of the asterisk (which you shouldn’t have to do), like “uk.linkedin.com.in/*”. You can make refinements for a list of countries in one CSE, but make sure you append the asterisks to the templates. Here is the current list of all countries if you need it: LinkedIn Countries.
  5. When searching, avoid using ORs, search several times instead – otherwise, you will be getting fewer results.

Social List (which is based on CSE APIs) is up and running! We have found a way around all of the weirdness.

It’s hard to say what to expect in the future months, but let’s hope that mutual understanding between CSEs and LinkedIn will be restored – and improved. It would be nice to search not only for schools, companies, and headlines but also for locations.

Cheers!

Competitive Intelligence from Open Datasets

booleanstringsBoolean, Diversity

Do you know about Open (public) Datasets? Google got its Dataset Search out of Beta, and you should bookmark it.

Lee Candiotti has asked this question on our group

Great question.

Let’s start with ways to locate datasets, in addition to Google’s tool. You can simply Google to locate datasets. And once you start looking, it’s endless! There’s Google-owned Kaggle.com. There are lists of datasets. There are gigantic lists of datasets and lists of favorites, and suggestions on Ycombinator.

There is an alternative way to find datasets, and that is in a Custom Search Engine, using a schema.org Object “Dataset”. I have created one, try it.

What can we expect to find in datasets? Industry, employment, and demographic data, but also, sometimes, contact lists and organizational charts. Datasets can be awesome for Competitive Intelligence.

Here are some nice finds.

I got lucky having found a downloadable list of 60K+ CEO and VP email contacts. (You are welcome! Please use it with care.)

There is a downloadable 7 MLN  US companies dataset at Peopledatalabs. It includes Company Name, Domain, Linkedin, # of Employees, Year Founded, Industry, and Location (city, state, country).

Here is a list of 5K job titles and relevant skills, which might help if you are searching in an unfamiliar industry.

Some dataset sites have visualizations, like this – Diversity in Tech. Many offer Dynamic Insights in interactive dialogs.

Come to our new webinar on CI to learn more! Today’s class was sold out but we have scheduled a repeat. Don’t miss it!