Google has been automatically dropping terms from searches that give few or no results for some time. It now looks as though Bing may be doing the same. Unfortunately I cannot give the details of the search that brought this to light as it was confidential research. In general, though, what we were searching for were announcements or news articles about two companies involved in a particular project. We hadn’t found anything in Google so we tried various alternative search engines including Bing (http://www.bing.com/). The results seemed quite promising until we started looking at the individual pages. None of them had all of our terms. It is possible that the missing terms appeared in links to the pages but the content of the documents suggested that this was unlikely, and there is no reliable free tool that shows you who is linking to a specific page. So it looks as though Bing is now dropping terms in the same way that Google does.
There are two ways to stop Bing doing this. The first is to use the Boolean AND operator between all of your terms. The second is to prefix the term that must be present in a document with ‘inbody:’, for example inbody:aardvark.
Did we find anything that answered our question? No, but sometimes I don’t expect to and it is frustrating when the search engine thinks it knows best and unilaterally decides to rewrite the search strategy.
For a list of all of the Bing advanced search commands go to http://msdn.microsoft.com/en-us/library/ff795620.aspx
This is a feature which I have been seeing on and off for a few months so I’m not sure if it is one of their experiments or if it is being rolled out gradually. It’s very simple: advertisements that appear at the top of your results lists and in the panel to the right are marked with a little yellow box with ‘Ad’ written inside.
Over the years it has become harder to identify ads at the top of results as the pale pastel backgrounds to them became more subtle. It has been suggested that the more obvious marker is a consequence of discussions between Google and various regulatory authorities.
Several weeks ago I noticed that Google was displaying the terms it had dropped from your search as ‘Missing’. Google started routinely ignoring selected search terms towards the end of 2011 (see http://www.rba.co.uk/wordpress/2011/11/08/dear-google-stop-messing-with-my-search/). Google’s response to the outcry from searchers was to introduce the Verbatim search option. However, there was no way of checking whether all of your terms appeared in a result other than viewing the whole page. Irritating, to say the least, if you found that the top 10 results did not include all of your keywords.
Fast forward to December 2013, and some people started seeing results lists that showed missing keywords as strikethroughs. I saw them for a few days and then, just as I was preparing a blog posting on the feature, they disappeared! I assumed that they were one of Google’s live experiments never to be seen again but it seems they are back. Two people contacted me today to say that they are seeing strikethroughs on missing terms. I ran my test searches again and, yes, I’m seeing them as well.
I ran the original search that prompted my November 2011 article (parrots heron island Caversham UK) and included -site:rba.co.uk in the strategy to exclude my original blog postings. Sure enough, the first two results were missing parrots and had “Missing
parrots” underneath their entry in the list.
Remember, though, that Google automatically looks for variations on your search terms. Your original keyword may not be present in the results but a synonym may be, for example birds instead of parrots. I did find one entry in the top 10 results that seemed out of place. It was an article in the local newspaper. There was no ‘Missing….’ underneath the result but I could not find synonyms of parrots anywhere in the story. The closest was the surname of a person mentioned in the article – Parry – and I can only assume that this was Google truncating my ‘parrots’ keyword and looking for variations. But Parry was not highlighted in the entry in the results list as synonyms are. I shall look at the article in more detail to check that I have not missed a lurking parakeet or cockatoo!
I need to carry out more test searches to see how reliable this new feature is but, nevertheless, it is a welcome and useful addition to Google. If your top 10 results are showing strikethroughs then you know you have to either prefix the missing term with ‘intext:’ or use Verbatim for the whole search.
A couple of weeks ago I was in Exeter and Bristol leading workshops for NHS South West on “Google & Beyond”. We covered advanced Google commands, Google Scholar and alternatives to Google. Below are the combined top tips from the two sessions. I may have missed a couple from the list as I could not read my writing, so if you attended one of the workshops let me know if I’ve omitted your suggested tip.
- Verbatim Yet again, this has topped the list of useful Google search options. Google automatically looks for variations on your search terms and sometimes drops terms from your search without telling or asking you. To make Google run your search exactly as you have typed it in, first run your search. Then click on ‘Search tools’ in the menu above your results, in the second line of options that appears click on ‘All results’ and from the drop down menu select Verbatim.
- Be aware of personalisation. Even if you are not signed in to a Google account Google personalises your results according to your search and browsing behaviour. Personalisation is not necessarily a bad thing but if your want to burst out of the filter bubble, as it is often called, use a private browser window or incognito (Chrome). Google will then ignore tracking and search cookies on your machine and will not personalise your results. To call up a private browser or incognito window use the following keys:
Chrome – Ctrl+Shift+N
FireFox – Ctrl+Shift+P
Internet Explorer – Ctrl+Shift+P
- site: Use the site: command to focus your search on particular types of site, for example site:nhs.uk, or to search inside a large, rambling site. You can also use -site: to exclude sites from your search.
- intext: Google’s automatic synonym search can be helpful when looking for alternative terms, but if you want a term to be included in your search exactly as you have typed it then prefix the word with
- filetype: Use the filetype: command to limit your research to PowerPoint for presentations, spreadsheets for data and statistics or PDF for research papers and industry/government reports. Note that in Google filetype:ppt and filetype:xls will not pick up the newer .pptx and xlsx formats so you will need to include those in your strategy, for example filetype:ppt OR filetype:pptx, or run separate searches for each one. In Bing.com, though, filetype:pptx will pick up both .ppt and .pptx files.
- Advanced search commands and search options Learn how to use the search commands (for example
site:). Many of these can be used on the advanced search screen that can usually be found under the cog wheel in the upper right hand area of the screen, but that link sometimes disappears so learning the commands is a better bet. A list of the more useful Google commands is at http://www.rba.co.uk/search/SelectedGoogleCommands.shtml.
- Combine advanced search commands. Practise combining the advanced search commands for a more precise, focused set of results.
- Google Reading level. This changes the type of results that you see. Run your search and from the menu above the results select ‘Search tools’, ‘All results’ and from the drop menu ‘Reading level’. Options for switching between basic, intermediate and advanced reading levels should then appear just above the results. Click on the Advanced option to see results biased towards research. Google does not give much away as to how it calculates the reading level and it has nothing to do with the reading age that publishers assign to publications. It seems to involve an analysis of sentence structure, the length of sentences, the length of the document and whether scientific or industry specific terminology appears in the page.
- Numeric range. This command is unique to Google. Use it for anything to do with numbers – years, temperatures, weights, distances, prices etc. Simply type in your two numbers separated by two full stops as part of your search. This is a good way of limiting your search, for example, to forecasts over the few years.
- Limiting your search by date. To limit your search by date, for example the last month or year, first run your search. Then click on ‘Search tools’ in the menu above the results and from the second row of options that appears click on ‘Any time’. Select your time period or a custom range from the drop down menu.
- Use the minus sign to exclude documents containing a word. If you do not want documents containing a specific word prefix that word term with a minus sign. The minus sign can also be used with commands such as site: and filetype: to remove an individual site or type of document from your results.
- Million Short http://millionshort.com/. If you are fed up with seeing the same results from Google again and again give Million Short a try. Million Short runs your search and you can choose to remove the most popular web sites from the results. Originally, as its name suggests, it automatically removed the top 1 million but now you can choose to remove the most popular 100, 1000, 10k, 100k or million sites. The page that best answers your question might not be well optimised for search engines or might cover a topic that is so “niche” that it never makes it into the top results in Google or Bing.
- Creative commons searches for images. Rather than search for images and go through them individually to find one that you can legally use in your document or presentation, use advanced search options or tools that allow you to select the appropriate license from the start. In Google, use the usage rights menu on the image advanced search screen to search for images with the license you need. The US version of Bing images includes a license option in the menu at the top of your results.
Double check the license of the photo on the website or blog hosting it. The license you need may be associated with a different image and yours could, for example, be ‘all rights reserved’.Flickr has a page where you can search for images with a specific Creative Commons license at http://www.flickr.com/creativecommons.
- Compare in Google. This is not a Google command but if you type in a search such as
compare carrots with cabbageGoogle will create a table comparing the properties of the two items. Google has been known to get some of the data wrong, though, so it’s worth double checking the figures before you use them.
- Web archives. Want to see what was on a website a few years ago or trying to track down a document that seems to have vanished from the web? Try the Internet Archive Wayback Machine at http://www.archive.org/. Enter the URL of the website or document and you should then see a calendar of the snapshots that the archive has of the site or document. Choose a date from the calendar to view the page. The archive does not have everything but it is worth a try. See also the UK National Archives of old government websites and pages at http://www.nationalarchives.gov.uk/webarchive/ and the UK Web Archive at http://www.webarchive.org.uk/ukwa/.
- Statistics sites. Although you can often find statistics via Google, you may find dedicated official statistics sites quicker and more reliable. Some of the sites we covered during the workshops were:
NHS Statistics Links http://www.nhs.uk/Pages/LinkListing.aspx?CategoryId=Statistics
UK National Statistics Publication Hub http://www.statistics.gov.uk/
Office for National Statistics http://www.ons.gov.uk/
Welsh Government Statistics http://wales.gov.uk/topics/statistics/
Welsh Assembly Government StatsWales http://statswales.wales.gov.uk/
UK Open data http://data.gov.uk/
European Union Open Data Portal http://open-dat.europa.eu/en/
Last week’s workshop on Google had a wonderful mix of participants from the academic, government, public, NHS and legal sectors. And true to form, Google decided to change a few things on the day. The link to the Google News advanced search completely disappeared (it wasn’t that good anyway!) but now seems to have reappeared. The search options that appear at the top of the results pages had changed compared with screen shots that I had taken 2 days previously; they options now seem to change according to the type of query so we suspect that this is an example of Hummingbird (Google’s new algorithm). The Google custom search engine interface has changed yet again and presented challenges to even those of us who are regular users. And then there was the new Google log-out/log-in interface which had us all flummoxed until the end of the day. (That merits a separate blog rant).
In the time honoured tradition, at the end of the day the group was asked to come up with their top 10 tips for searching Google. Here is what they came up with.
- Verbatim Several people shouted out this one as the number 1 tip for searching Google. Google automatically looks for variations on your search terms and sometimes drops terms from your search without telling or asking you. To make Google run your search exactly as you have typed it in, first run your search. Then click on ‘Search tools’ in the menu above your results, in the second line of options that appears click on ‘All results’ and from the drop down menu select Verbatim.
- site: command Use the site: command to focus your search on particular types of site, for example site:ac.uk, or to search inside a large rambling site. You can also use -site: to exclude sites from your search.
- Advanced search commands and search options Learn how to use the search commands (for example
site:), Many of these can be used on the advanced search screen that can usually be found under the cog wheel in the upper right hand area of the screen, but that link sometimes disappears so learning the commands is a better bet. A list of the more useful Google commands is at http://www.rba.co.uk/search/SelectedGoogleCommands.shtml
- intext: Google’s automatic synonym search can be helpful in looking for alternative terms but if you want a term to be included in your search exactly as you have typed it in then prefix the word with
- Country versions of Google. The country versions of Google give priority to the country’s local content, although it might be in the local language. This is a useful strategy when searching for industries, companies and people that are active in a particular country. Use Google followed by the standard ISO two letter country code, for example http://www.google.de/ for Google Germany, http://www.google.no/ for Google Norway.
- Google.com Apart from presenting your search results in a different order Google.com is where Google launches new features and search options first. As well as seeing pages that may not be highly ranked in Google.co.uk you will get an idea of how Google search may look in the UK version in the future. It also has some unique search options such as recipes!
- Google Scholar http://scholar.google.com/ Google Scholar collects all the versions of an article under an ‘All versions” link.
- Search by image Click on the camera icon in the image search bar to upload a photo or link to an image on the web. Google will then try and find similar images. There were comments from some of the workshop participants that this does not seem to always work as well as it used to, which reflects my own recent experience of the option. It is still worth a go, though, if you want to find different versions of an image.
- Hummingbird Keep an eye out for new layouts and ways of searching that are now appearing since Hummingbird was launched. For example, you can now compare the properties of two similar items: compare cabbage with spinach will show a table comparing the nutritional value of the two vegetables.
- Google Custom Search Engines (CSE) Several of the participants had a go at setting up their own CSE. Ideal for bringing together websites that you search individually on a regular basis.
Edited highlights of the presentations can be found on authorSTREAM at http://www.authorstream.com/Presentation/karenblakeman-1975041-make-google-behave-techniques-better-search-results/ and http://www.authorstream.com/Presentation/karenblakeman-1975022-google-scholar-citation-indexes/
Just over a year ago Bing launched a website called Bingiton (http://www.bingiton.com/), which enabled you to compare search results from Google and Bing side by side and then decide which set was best. You had to run five searches and then Bingiton told you which search engine you had chosen for each. After a couple of weeks the site was restricted to US users but it has now been relaunched in the UK.
The principle remains the same: you type in five searches, Bingiton displays the two sets of results side by side, and you decide which you prefer or go for the draw option (“can’t decide”).
I ran several batches of searches through Bingiton and Google won 4 of the rounds. The fifth, which consisted of searches for cake and jam recipes, was a draw with me being unable to decide. Two other rounds had to be declared null and void because “Scholarly articles” links (Google Scholar) appeared at the top of one set of results indicating that they were from Google. Another included what was obviously a Google map!
For me Bing seemed to be better at recipes and shopping enquiries than research oriented queries. Google consistently came out on top for local information and current news. Phil Bradley has also blogged about Bingiton (http://philbradley.typepad.com/phil_bradleys_weblog/2013/10/bings-bing-it-on-challenge-returns-to-the-uk.html) and invited people to comment on their own results. It is an interesting mix and Google does not always win or win outright. Take the Bingiton challenge yourself at http://www.bingiton.com/.
If you have a Google+ account you should have received an email from Google with the innocuous subject line “An Update for Google+ Page Owners and Managers”. Don’t ignore it because it contains important information about changes to Google’s terms and conditions that take effect on November 11th, 2013. From that date Google may start using your profile name and photo to endorse ads. Several people have already posted extensively about this so I will not go into much detail here. Tony Hirst has already written an excellent article on this at http://blog.ouseful.info/2013/10/12/googles-new-terms-mean-you-could-soon-been-acting-as-a-product-endorser/, which also addresses other privacy issues and account settings that you may wish to review and change.
If you do not want your photo and name used in advertisements you have to opt out because Google seems to have automatically added everyone to the scheme. Sign in to your Google+ account and go to the settings page. You should see a new option for “Shared endorsements” that is switched on. Click on the Edit link to change it to “Off”. Of course, Google tries to discourage you from doing this by saying that “your friends will be less likely to benefit from your recommendations”.
Click continue to carry on and switch endorsements off.
The September issue of Tales from the Terminal Room is now available at http://www.rba.co.uk/tfttr/archives/2013/sep2013.html
This month’s issue includes:
- Recent presentations
- Search tools – Google announces Hummingbird…..hmmmm
- Top tips from the latest business information workshop
- Interactive maps of UK renewable energy generation
- Twitter notes
- Forthcoming workshops and meetings How to Make Google behave: techniques for better results, Wednesday, 30th October 2013
- Free resources and search techniques for EU and UK legislation, Wednesday, 13th November 2013, London
- Anything but Google, Tuesday, 19th November 2013, London
The newsletter is a compilation of recent articles from this blog plus some extra goodies such as Twitter Notes.
I recently mentioned Gridwatch (How the UK’s electricity is generated http://www.rba.co.uk/wordpress/2013/08/21/how-the-uks-electricity-is-generated/) as a way of tracking how much energy is passing through the National Grid and the technology used to generate that electricity. Although Gridwatch is a great way of observing the total amount of electricity that is generated by each technology – gas, coal, wind etc – it does not go into any detail with respect to individual installations. The Digest of UK energy statistics (DUKES) produced by the UK Department of Energy & Climate Change (http://www.gov.uk/government/organisations/department-of-energy-climate-change/series/digest-of-uk-energy-statistics-dukes) includes a spreadsheet listing all of the operational power stations, fuel that they use, installed capacity, location, and the year that generation began. The direct link to the spreadsheet is http://www.gov.uk/government/uploads/system/uploads/attachment_data/file/226895/dukes5_11.xls. This is historical data and the current list refers to plants in operation at the end of May 2013.
UK Energy Watch has a map (http://www.ukenergywatch.org/Electricity/PowerStations) showing the location of UK power stations of 400 MW or larger, so it is by no means comprehensive. It does allow you, though, to click on a plant and display current generation except for CCGT stations (Combined Cycle Gas Turbine).
There are more options available when it comes to what are called renewables (for example wind, solar, biomass, hydro). The DECC’s RESTATS interactive map at http://restats.decc.gov.uk/app/pub/map/map/ enables you to search by technology, region, county, planning authority and application status. It also has a separate map for wind farm capacities.
The site information includes installed capacity, details of the planning application but not how much energy is actually being produced. RESTATS says that “Information is held on the performance of operational projects but owing to the need to maintain the commercially sensitive nature of these data, specific site details and performance figures are not disclosed“.
The UK Data explorer has produced a renewables map at http://ukdataexplorer.com/renewables/ that uses the RESTATS data and shows operational renewable electricity sites over 0.01 MW. The different colours represent the type of plant and the area of the circles indicate installed capacity (maximum power output).
To see details of a specific installation you should be able to hover over a point on the map. This did not work for me with some of the smaller plants and when I tried to zoom in on an area I often lost the background map.
The Interactive Map of Renewable and Alternative Energy Projects in the UK at http://www.renewables-map.co.uk is another interactive map and can be filtered by technology type and planning status.
According to the website the information is gathered from “a wide range of web resources, in all cases these will be referenced, usually by a link to that information. Locations are either taken from existing data, usually from planning applications, or by painstakingly identifying the location on the ground using online maps.” I am not sure how up to date the map is and I noticed that the smaller hydro installations along the Thames are missing. Another problem that I have experienced with this site is that when I click on “More details” for an installation I get far too many “internal sever errors”. However, when the information does appear it includes useful comments on the technology, links to relevant websites and the latest news.
The final one in my list is from the energy generating company RWE Innogy (http://www.rwe.com/web/cms/en/194914/rwe-innogy/sites/production-data-live/). Its interactive map provides information on most of its European plants and includes wind farms, hydro power plants and biomass CHP (Combined Heat and Power). The production data is updated every minute. To see information on an installation, click on its icon on the map. The information includes live production, location, type of installation and when production started.
These are by no means the only websites offering interactive maps and information on UK energy production, and none of them give the full picture. They are good starting points, though, if you are interested in researching individual technologies or individual power stations.