Tag Archives: top tips

Essential Non-Google Search Tools for Researchers – Top Tips

This is the list of Top Tips that delegates attending the UKeiG workshop on 7th September 2016 in London came up with at the end of the training day.  Some of the usual suspects such as the ‘site:’ command, Carrot Search and Offstats are present but it is good to see Yandex included in the list for the first time.

  1. Carrotsearch http://search.carrotsearch.com/carrot2-webapp/search or http://carrotsearch.com/ and click on the “Live Demo” link on the left hand side of the page.
    This was recommended for its clustering of results and also the visualisations of terms and concepts via the circles and “foam tree”. The Web Search uses eTools.ch for the general searches and there is also a PubMed option.

    Carrot Search Foam PubMed Foam Tree
    Carrot Search Foam PubMed Foam Tree
  1. Advanced Twitter Search http://twitter.com/search-advanced
    The best way to search Twitter! Use the Advanced Search http://twitter.com/search-advanced or the click on the “More Options” on the results page. There is a detailed description of the commands and how they can be used at https://blog.bufferapp.com/twitter-advanced-search 
  1. Yandex http://www.yandex.com/
    The international version of the Russian search engine with a collection of advanced commands – including a proximity operator – that makes it a worthy competitor to Google. Run your search and on the results page click on the two line next to search box.

    Yandex Advanced Search
    Yandex Advanced Search

    Alternatively, use the search operators. Most of them are listed at https://yandex.com/support/search/how-to-search/search-operators.xml. There is also a /n operator that enables you to specify that words/phrases must appear within a certain distance of each other, for example:

    "University of Birmingham" nanotechnology /2 2020

    There are country versions of Yandex for Russia, Ukraine, Belarus, Kazakhstan and Turkey. You will, though, need to know the languages to get the best out of them and apart from Turkey they use a different alphabet.

  1. Millionshort http://millionshort.com/
    If you are fed up with seeing the same results from Google again and again give MillionShort a try. MillionShort enables you to remove the most popular web sites from the results. The page that best answers your question might not be well optimised for search engines or might cover a topic that is so specialised that it never makes it into the top results in Google or Bing.Originally, as its name suggests, it removed the top 1 million but you can change the number that you want omitted. There are filters to the left of the results enabling you to remove or restrict your results to ecommerce sites, sites with or without advertising, live chat sites and location. The sites that have been excluded are listed to the right of the results.
  1. site: command
    Use the site: command to focus your search on particular types of site, for example include site:ac.uk in your search for UK academic websites. Or use it to search inside large rambling sites with useless navigation, for example site:www.gov.uk. You can also use -site: to exclude individual sites or a type of site from your search. All of the major web search engines support the command.
  1. Microsoft Academic Search http://academic.research.microsoft.com/
    An alternative to Google Scholar.“Semantic search provides you with highly relevant search results from continually refreshed and extensive academic content from over 80 million publications.”This was recently revamped and although it now loads and searches faster than it used to the new version has lost the citation and co-author maps that were so useful. It can be a useful way of identifying researchers, publications and citations but do not rely on the information too much. It can get things very wrong indeed. For example, I’ve found that for some reason the affiliation of several authors from the Slovak Technical University in Bratislava is given as the Technical University of Kenya!
  1. Wolfram Alpha https://www.wolframalpha.com/
    This is very different from the typical search engine in that it uses its own curated data. Whether or not you get an answer from it depends on the type of question and how you ask the question. The information is pulled from its own databases and for many results it is almost impossible to identify the original source, although it does provide a possible list of resources. If you want to see what WolframAlpha can do try out the examples and categories that are listed on its home page.
  1. OFFSTATS – The University of Auckland Library http://www.offstats.auckland.ac.nz/
    This is a great starting point for locating official statistical sources by country, region or subject. All of the content in the database is assessed by humans for quality and authority, and is freely available.
  1. Meltwater IceRocket http://www.icerocket.com/
    IceRocket specialises in real-time search and was recommended for inclusion in the Top Tips for its blog search and advanced search options. There is also a Trends tool that shows you the frequency with which terms are mentioned in blogs over time and which enables you to compare several terms on the same graph.

    IceRocket Trends
    IceRocket Trends

    Very useful for comparing, for example, mentions of products, companies, people in blogs.

  1. Behind the Headlines NHS Choices http://www.nhs.uk/news/Pages/NewsIndex.aspx
    Behind the headlines provides an unbiased and evidence-based analysis of health stories that make the news. It is a good source of information for confirming or debunking the health/medical claims made by general news reporting services, including the BBC. For each “headline” it summarises in plain English the story, where it came from and who did the research, what kind of research it was, results, researcher’s interpretation, conclusions and whether the headline’s claims are justified.

Advanced Google workshop – Top Tips

This collection of Top Tips is a combined list nominated by those who attended the UKeiG workshop on “New Google, New Challenges”. The next UKeiG Google workshop will be run on 8th September 2016.

1. Do not trust Google’s facts and answers
Google tries to provide facts and quick answers to your queries at the top and to the right of your results. These are computer generated extracts from pages and several different sources may be used to produce an “answer”. They are sometimes misleading or completely wrong. At the time of writing, the answer provided for a search on frugivore is an excellent example. (It explains why your cat is so fussy over its food – it is obviously craving its 5 a Day!) Always go to the original source to double check the information, but this is not always provided by Google.

2. Country versions of Google and /ncr
Country versions of Google give priority to the local content. This is a useful strategy when searching for research groups, companies and people that are active or working in a particular country. Use the standard ISO two letter country code, for example http://www.google.fr/ for Google France, http://www.google.it/ for Google Italy.

It is also worth trying your search in Google.com. Your results will probably be more international or US focused but you may see new search features or layouts in Google.com that are not yet available elsewhere. If Google insists on redirecting you to your own country version, go to the bottom right hand corner of the Google home page and you should see a link to Google.com. If there is no link then add ‘/ncr’ to the Google URL, for example http://www.google.com/ncr .

The downside of using country versions of any search tool is that the prioritised information is likely to be in the local language.

3. Search history
Your search history, which is recorded and available for you to view if you are signed in to your Google account, is used by Google to help personalise your results but it can also be useful as a record of past searches. If a user comes back to you having forgotten or lost the search and documents you gave them your search history should be able to help you find both. On any search results page click on the cog wheel in the upper right hand area of the screen and select History. You can then browse your history or select a date from the calendar (upper right and area of the History screen).

4. Verbatim
This is an essential tool for making Google carry out your search the way you want it run. Google automatically looks for variations on your terms and sometimes drops terms from your search, which is not always helpful. To use Verbatim, first run your search. Then click on ‘Search tools’ in the menu that runs across the top of your results page. A second row of options should appear. Click on ‘All results’ and from the drop down menu select Verbatim. Google will then search for your terms without any variations or omissions. Note that Google will search for documents and pages in which the words appear in any order. If you are searching on the title of a paper place the title within double quote marks to force an exact phrase match. If Google still alters your search then run Verbatim. 

Verbatim-Factsheet
If you are carrying out in-depth research it is worth trying out Verbatim even if the “normal” Google results seem OK. You may see very different and possibly more relevant content.

5. filetype: command.
An important advanced search command that is available not only in Google but in many alternative search tools. Use the filetype: command to limit your research to PowerPoint for presentations, spreadsheets for data and statistics, or PDF for research papers and industry/government reports.

For example:

plasmonic nanoparticles filetype:ppt

The command must be all lower case and there must be no spaces between the colon and the command or the file extension, otherwise Google will treat the command as a searchable word. Also you must search for pre and post Office 2007 file extensions separately as Google does not automatically pick up both.

For example

plasmonic nanoparticles filetype:ppt OR filetype:pptx

Note that Google’s Advanced Search screen pull down menu for filetype: only searches for pre Office 2007 extensions.

6. Minus sign to exclude information
Use the minus sign immediately before a term to exclude documents containing that term, but use with care as you may lose valuable information. It can also be used with commands to exclude file formats or websites from your search.

For example:

occupational asthma UK site:gov.uk -site:hse.gov.uk
-site:nationalarchives.gov.uk

7. Combine search commands
Combine multiple commands such as filetype: and site: to focus your search. Use the OR command to search for alternatives, for example:

occupational asthma UK site:ac.uk filetype:ppt OR filetype:pptx

8.Personalise Google News
Personalise Google News (http://news.google.co.uk) page when signed in to your account  and change what content is automatically displayed or add your own searches. Click on the Personalise button at the top of the right hand column. 

9. Google Scholar Cite feature
Click on the Cite link under a reference in Google Scholar and Google will give you options to import a citation in MLA, APA, Chicago, Harvard or Vancouver style into BibTex, EndNote, RefMan or RefWorks. Note that if the article is only available online you may need to add a doi or a URL, and the date of access.

10. Use Google site: search on Google scholar
This is one I had not thought of but was recommended by one of the delegates as a way of using Google’s advanced search commands on Google Scholar instead of Scholar’s own. (I have not had time to test this one out myself).

Business information key resources and search strategies – Top 10

The participants  of the business information workshop I ran on March 8th  had a variety of interests: search strategies and commands for Google et al,  UK government information, statistics, open data, social media, companies, locating scientific research.  So it was quite tough limiting the Top Tips that I asked them to nominate at the end of the day to just 10.

This is what they came up with.

  1. Get to know the key resources and starting points for different types of business information e.g. Companies House, OFFSTATS and go direct to those rather than Google. It will save you time in the long run.
  2.  Verbatim. An invaluable tool for research when Google insists on rewriting your search and dropping terms. To make Google search for all of your terms without variation, but in any order, first run your search. Then click on ‘Search tools’ in the line of options above your results. In the second line of options that appears click on ‘All results’ and from the drop down menu select Verbatim. If you are carrying out in-depth research it is worth using Verbatim even if your “normal” Google results seem to be OK. You may see very different content in the Verbatim list.
  3. Combine advanced search commands such as site: and filteype: to focus your search on types of information (PDF reports, PPT presentations, spreadsheets containing data) and websites (government, academic, individual sites). Also try using the minus sign to exclude documents containing specific terms or sites that are irrelevant.
  4. Phil Bradley’s UK Newspapers Google Custom Search Engine. http://www.philb.com/nationaluknewspapers.htmlPhilB_News_Search
    A relatively new tool that enables you to search all of the major national UK newspapers and regional newspapers. A real time saver if you are searching for local information on a local business or entrepreneur and don’t want to have to track down all the local papers and search them one by one.
  5. OFFSTATS – The University of Auckland Library http://www.offstats.auckland.ac.nz/ A good starting point for official statistical sources by country, region subject or combination of categories. All of the content in the database has been chosen and quality assessed by staff at The University of Auckland Library.
  6. Zanran http://zanran.com/ A tool for searching information contained in charts, graphs and tables of data. Enter your search terms and optionally limit your search by date and/or format type. Zanran comes up with a list of documents that match your criteria with thumbnails to the left of each entry. Hover over the thumbnail to see a preview of the page containing your data and further information on the document.
  7. Advanced Twitter Search. http://twitter.com/search-advanced Essential tool if you are using Twitter to look for news on product developments, announcements, conferences, discussions on technologies/companies, or how companies interact with customers.
  8. Wayback Machine http://www.archive.org/ Want to see what was on a website a few years ago or trying to track down a document that seems to have vanished from the web? Try the Internet Archive Wayback Machine at http://www.archive.org/. Enter the URL of the website or document and you should then see a calendar of the snapshots that the archive has. Choose a date from the calendar to view the page. The archive does not have everything but it is worth a try. See also the UK National Archives of old government websites and pages at http://www.nationalarchives.gov.uk/webarchive/
  9. OUsefulInfo, http://blog.ouseful.info/ “Trying to find useful things to do with emerging technologies in open education and data journalism”. Maintained by Tony Hirst, this blog has useful information and descriptions of what can be involved when dealing with and manipulating open data.
  10. DuckDuckGo  http://duckduckgo.com/ This was not covered in the workshop but one of the participants recommended it as a useful alternative to Google. Aside from the absence of tracking and personalisation it provides different and a greater variety of results when compared with Google.

Edited highlights of the workshop slides can be found on authorSTREAM and Slideshare.

My next business information related workshop is Discover Open Data on  the 7th April.  The  next advanced Google workshop (New Google, New Challenges) is on the 13th April and the Essential non-Google search tools is on  the 12th April.

Top search tips from Exeter and Bristol

A couple of weeks ago I was in Exeter and Bristol leading workshops for NHS South West on “Google & Beyond”. We covered advanced Google commands, Google Scholar and alternatives to Google. Below are the combined top tips from the two sessions. I may have missed a couple from the list as I could not read my writing, so if you attended one of the workshops let me know if I’ve omitted your suggested tip.

  1. Verbatim Yet again, this has topped the list of useful Google search options. Google automatically looks for variations on your search terms and sometimes drops terms from your search without telling or asking you. To make Google run your search exactly as you have typed it in, first run your search. Then click on ‘Search tools’ in the menu above your results, in the second line of options that appears click on ‘All results’ and from the drop down menu select Verbatim.
  2. Be aware of personalisation. Even if you are not signed in to a Google account Google personalises your results according to your search and browsing behaviour. Personalisation is not necessarily a bad thing but if your want to burst out of the filter bubble, as it is often called, use a private browser window or incognito (Chrome). Google will then ignore tracking and search cookies on your machine and will not personalise your results. To call up a private browser or incognito window use the following keys:

Chrome –  Ctrl+Shift+N
FireFox – Ctrl+Shift+P
Internet Explorer – Ctrl+Shift+P

  1. site: Use the site: command to focus your search on particular types of site, for example site:nhs.uk, or to search inside a large, rambling site. You can also use -site: to exclude sites from your search.
  2. intext: Google’s automatic synonym search can be helpful when looking for alternative terms, but if you want a term to be included in your search exactly as you have typed it then prefix the word with intext:.
  3. filetype: Use the filetype: command to limit your research to PowerPoint for presentations, spreadsheets for data and statistics or PDF for research papers and industry/government reports. Note that in Google filetype:ppt and filetype:xls will not pick up the newer .pptx and xlsx formats so you will need to include those in your strategy, for example filetype:ppt OR filetype:pptx, or run separate searches for each one. In Bing.com, though, filetype:pptx will pick up both .ppt and .pptx files.
  4. Advanced search commands and search options Learn how to use the search commands (for example intext:, filetype: and site:). Many of these can be used on the advanced search screen that can usually be found under the cog wheel in the  upper right hand area of the screen, but that link sometimes disappears so learning the commands is a better bet. A list of the more useful Google commands is at http://www.rba.co.uk/search/SelectedGoogleCommands.shtml.
  5. Combine advanced search commands. Practise combining the advanced search commands for a more precise, focused set of results.
  6. Google Reading level. This changes the type of results that you see. Run your search and from the menu above the results select ‘Search tools’, ‘All results’ and from the drop menu ‘Reading level’. Options for switching between basic, intermediate and advanced reading levels should then appear just above the results. Click on the Advanced option to see results biased towards research. Google does not give much away as to how it calculates the reading level and it has nothing to do with the reading age that publishers assign to publications. It seems to involve an analysis of sentence structure, the length of sentences, the length of the document and whether scientific or industry specific terminology appears in the page.
  7. Numeric range. This command is unique to Google. Use it for anything to do with numbers – years, temperatures, weights, distances, prices etc. Simply type in your two numbers separated by two full stops as part of your search. This is a good way of limiting your search, for example, to forecasts over the few years.
  8. Limiting your search by date. To limit your search by date, for example the last month or year, first run your search. Then click on ‘Search tools’ in the menu above the results and from the second row of options that appears click on ‘Any time’. Select your time period or a custom range from the drop down menu.Google date
  9. Use the minus sign to exclude documents containing a word. If you do not want documents containing a specific word prefix that word term with a minus sign. The minus sign can also be used with commands such as site: and filetype: to remove an individual site or type of document from your results.
  10. Million Short http://millionshort.com/. If you are fed up with seeing the same results from Google again and again give Million Short a try. Million Short runs your search and you can choose to remove the most popular web sites from the results. Originally, as its name suggests, it automatically removed the top 1 million but now you can choose to remove the most popular 100, 1000, 10k, 100k or million sites. The page that best answers your question might not be well optimised for search engines or might cover a topic that is so “niche” that it never makes it into the top results in Google or Bing.
  11. Creative commons searches for images. Rather than search for images and go through them individually to find one that you can legally use in your document or presentation, use advanced search options or tools that allow you to select the appropriate license from the start. In Google, use the usage rights menu on the image advanced search screen to search for images with the license you need. The US version of Bing images includes a license option in the menu at the top of your results.

Bing Image License option
Double check the license of the photo on the website or blog hosting it. The license you need may be associated with a different image and yours could, for example, be ‘all rights reserved’.Flickr has a page where you can search for images with a specific Creative Commons license at http://www.flickr.com/creativecommons.

  1. Compare in Google. This is not a Google command but if you type in a search such as compare carrots with cabbage Google will create a table comparing the properties of the two items. Google has been known to get some of the data wrong, though, so it’s worth double checking the figures before you use them.
  2. Web archives. Want to see what was on a website a few years ago or trying to track down a document that seems to have vanished from the web? Try the Internet Archive Wayback Machine at http://www.archive.org/. Enter the URL of the website or document and you should then see a calendar of the snapshots that the archive has of the site or document. Choose a date from the calendar to view the page. The archive does not have everything but it is worth a try. See also the UK National Archives of old government websites and pages at http://www.nationalarchives.gov.uk/webarchive/ and the UK Web Archive at http://www.webarchive.org.uk/ukwa/.
  3. Statistics sites. Although you can often find statistics via Google, you may find dedicated official statistics sites quicker and more reliable. Some of the sites we covered during the workshops were:

    NHS Statistics Links http://www.nhs.uk/Pages/LinkListing.aspx?CategoryId=Statistics
    UK National Statistics Publication Hub http://www.statistics.gov.uk/
    Office for National Statistics http://www.ons.gov.uk/
    Welsh Government Statistics http://wales.gov.uk/topics/statistics/
    Welsh Assembly Government StatsWales http://statswales.wales.gov.uk/
    UK Open data http://data.gov.uk/
    Eurostat http://ec.europa.eu/eurostat/
    European Union Open Data Portal http://open-dat.europa.eu/en/
    Zanran http://www.zanran.com/

Top tips from the latest business information workshop

Delegates at yesterday’s business information workshop in London came up with an interesting combination of websites and search commands for their end of day top tips.

  1. OFFSTATS – Official Statistics on the Web  http://www.offstats.auckland.ac.nz Excellent starting point for official statistical sources by country, region subject or a combination of categories. All of the content in the database is in the public domain and available through the Internet.

    OFFSTATS

  2. GMacker http://gmacker.com/web/content/gDateRange/gdr.htm Google’s Verbatim in the search options menu on the results page is great. Google’s date option from the same menu is great. But you cannot use both together. You can use the daterange: command, though, with Verbatim but it’s complicated. GMacker is a much easier way to do it. Type in your search on the GMacker page, select your dates from the calendars and click on ‘Google Search’. When the results appear on Google simply apply Verbatim in the usual way.
  3. Domain Tools http://www.domaintools.com/ A useful tool for identifying who owns the domain name of a website.
  4. 7 side http://www.7side.co.uk/ was recommended for its International company information services.
  5. Zanran http://zanran.com/This is a search tool for searching information contained in charts, graphs and tables of data and within formatted documents such as PDFs, Excel spreadsheets and images. Enter your search terms and optionally limit your search by date and/or format type.
  6. News alerts, news curation services and automated newsletter generation. Use Google alerts, RSS feeds and newsletter generation sites such as Paper.li (http://paper.li/) and Scoop.it (http://www.scoop.it/) to keep up to date and share news with colleagues.
  7. site: command Use the site: command to focus your search on particular types of site, for example site:ac.uk, or to search inside a large rambling site. You can also use -site: to exclude sites from your search.
  8. Numeric range This command is unique to Google. Use it for anything to do with numbers – years, temperatures, weights, distances, prices etc. Simply type in your two numbers separated by two full stops as part of your search. A good way of limiting your search to pages or news covering a company’s activities over two or three years in the past.
  9. OpenCorporates “The Open Database Of The Corporate World” http://opencorporates.com/ Provides access to open corporate data on 55 million companies in 75 jurisdictions. You can search all jurisdictions at the same time or select just one. Results can be filtered by type of data held, current status, company type, SIC. A link to the original registry page for a company is always included with the displayed information.
  10. DataMarket http://datamarket.com/ A portal to thousands of free and priced datasets. Free to search, and create charts and visualisations of the free data.

Top Tips from SWAMP

Swansea_20130624_400
View from Swansea Central Library

Towards the end of June I headed off to Swansea Central Library to facilitate a workshop on search tools and techniques for finding business information and statistics. The session was organised for the libraries of the wonderfully named SWAMP – South West and Mid Wales Partnership.

We had fantastic views from the library of the sea and shore line so they did very well to remain focused on the work in hand. The top tips that the group suggested at the end of the day were a mixture of search techniques and business information sites.

1. Persistence.
Don’t give up and don’t get stuck in a rut. If your first attempts fail to produce anything useful try a different approach to your search. Try some of the tips mentioned below: use advanced search commands, a different search tool or go direct to a website that covers your subject area or type of information.

 2. Verbatim.
Google automatically looks for variations on your search terms and sometimes drops terms from your search without telling or asking you. To beat Google into submission and make it run your search exactly as you have typed it in, click on ‘Search tools’ in the menu above your results, then click on the arrow next to ‘All results’ and from the drop down menu select Verbatim.

3. Private Browsing.
To stop search engines personalising your results according to your previous searches and browsing behaviour, find out where the private browsing option is in your browser (in Chrome it is called Incognito). This ignores all cookies and past search history and is as close as you can get to unfiltered results.

Short cuts to private browsing in the main browsers are:

Chrome – Ctrl+Shift+N

FireFox – Ctrl+Shift+P

Internet Explorer – Ctrl+Shift+P

Opera – Ctrl+Shift+N

Safari – click on Safari next to the Apple symbol in the menu bar, select Private Browsing and then click on OK.

4. The site: command.
Include the site: command in your search to focus your search on particular types of site, for example site:ac.uk, or to search inside a large rambling site. You can also use -site: to exclude sites from your search. For example, if you are searching for information on Wales and Australian websites mentioning New South Wales keep coming up include -site:au in your search.

5. The filetype: command.
Use the filetype: command to limit your research to PowerPoint for presentations, spreadsheets for data and statistics or PDF for research papers and industry/government reports. Note that in Google filetype:ppt and filetype:xls will not pick up the newer .pptx and xlsx formats so you will need to incorporate both into your strategy, for example filetype:ppt OR filetype:pptx, or run separate searches for each one. In Bing.com, though, filetype:pptx will pick up both .ppt and .pptx files.

6. Guardian Data Store (http://www.guardian.co.uk/data/)
For datasets and visualisations relating to stories in the news. This is proving to be a very popular site on both the public and in-house workshops. As well as the graphs and interactive maps the source of the data is always given and there are links to the original datasets that are used in the articles.

7. Company Check (http://www.companycheck.co.uk/)
Company Check repackages Companies House data and provides 5 years of figures and graphs for Cash at Bank, Net Worth, Total Liabilities and Total Current Liabilities free of charge. It also  lists the directors of a company. Click on a director’s name and you can view other current and past directorships for that person.

8. BL BIPC industry Guides
The British Library Business Information and IP Centre’s industry guides at  http://www.bl.uk/bipc/dbandpubs/Industry%20guides/industry.html highlight relevant industry directories, databases, publications and web sites. Excellent starting points if you are new to the sector.

9. Web archives for documents, pages and sites that are no longer “live”.
Most people know about the Internet Archive’s Wayback Machine at http://www.archive.org/and its collection of snapshots of websites taken over the years. There is also a collection of old UK government webpages at http://www.nationalarchives.gov.uk/webarchive/, and the British Library has a UK web archive at http://www.webarchive.org.uk/ukwa/.

10. Keep up to date
Keep up to date with what the search engines are up to, changes to key resources and new sites. Identify blogs and commentators that are relevant to your research interests and subject areas and follow them using RSS or email alerts.

North Wales Libraries Partnership Top Tips

Cyril in the John Spalding Library

The John Spalding Library in Wrexham hosted the North Wales Libraries Partnership (NWLP) workshop “Search is more than just Google”. Delegates from public, government, academic and NHS libraries gathered together to look at the effect of mobile technologies on search, open access, getting better results from Google and alternative search tools. The consensus reached during one of the breaks was that Cyril, one of the library’s residents and pictured on the left, should have ignored Google’s nutrition advice and gone for the more authoritative sources available in the library and on the web. If only he had waited and attended the workshop he would have known exactly where to look!

There was much discussion on how mobile devices change how we can search – not always for the best – and there was concern, as usual, over how much we willingly give away about ourselves to services such as Google and Facebook. Open access was debated in the afternoon along with possible directions for academic publishing.

An edited set of the slides is available on authorSTREAM at http://www.authorstream.com/Presentation/karenblakeman-1856150-search-google/ and Slideshare at http://www.slideshare.net/karenblakeman1/search-is-more-than-just-google.

The Top Tips that the group came up with included some of the usual advanced Google commands but others concerned cloud computing and social media. Here they are.

1. Back up your stuff. Having your data hosted in the cloud means you don’t have to worry about it disappearing when your laptop or server crashes. But what if your cloud service goes under or your account is deleted for some reason? Have you made a local backup of your essential files and treasured family photos? One of the participants mentioned the Library of Congress digital preservation toolkit for preserving family memories (http://www.digitalpreservation.gov/personalarchiving/).

2. Private browsing for “un-personalising” search results. If you want to make sure that your results are not being influenced by past searches and browsing behaviour, find out where the private browsing option is in your browser (in Chrome it is called Incognito). This ignores all cookies and past search history and is as close as you can get to unfiltered results.

3. Change the order of your search terms to change the order in which results are listed. This is an old trick but still seems to work.

4. Use advanced search commands such as site:, filetype;, intext:, to focus your search. Some of the commands are available not just in Google but also in Bing and DuckDuckGo.

5. Create “newspapers” of articles mentioned on Twitter, Facebook, Google+ or news sites by using services such as Paper.li (http://paper.li/). These can be generated from hashtags, keyword searches or your own Twitterstream. Have a look in the Paper.li news stand to see if someone has already created a paper on your topic. Paper.li automatically compiles the newspaper but there are other services such as Storify (http://storify.com/) and Scoop.it (http://www.scoop.it/) that enable individuals to curate the content that appears in their personal newspaper.

6. Guardian Data Store for datasets and visualisations relating to stories in the news (http://www.guardian.co.uk/data). This was so popular that it was mentioned twice for inclusion in the top tips. What people liked about this is that the source of the data is always given and there are links to the original datasets.

7. Million Short http://millionshort.com/. If you are fed up with seeing the same results from Google again and again give Million Short a try. Million Short runs your search and then removes the most popular web sites from the results. Originally, as its name suggests, it removed the top 1 million but the default has changed to the top 10,000. The page that best answers your question might not be well optimised for search engines or might cover a topic that is so “niche” that it never makes it into the top results in Google or Bing. One person loved it because the type of research they do often pulls up pages of Amazon and eBay results in Google. Not a problem with Million Short

8. Google Reading level to change the type of results that you see. Run your search and from the menu above the results select ‘Search tools’, ‘All results’ and from the drop menu ‘Reading level’. Options for switching between basic, intermediate and advanced reading levels should then appear just above the results. Click on the Advanced option to see results biased towards research.

9. Beware fragmented discussions. Articles can be posted and reposted in many different places: blogs, websites, LinkedIn, Facebook etc. with the result that potentially useful and informative discussions are dotted all over the place. Learn how to locate fragmented discussions in your subject area and where they are likely to occur.

10. Try something other than Google. Take a look at the slides for a few(!) suggestions of what you could use.

Business information slides and top tips

My latest business information workshop, organised by TFPL, was held yesterday in London. A large chunk of the session was taken up with exploring and discussing web sites but we also looked at how advanced search options and commands can be used to focus on higher quality business information. An edited version of the slides is available on authorSTREAM at http://www.authorstream.com/Presentation/karenblakeman-1775787-business-information-key-web-resources/ and Slideshare at http://www.slideshare.net/KarenBlakeman/business-information-key-web-resources-19252576.

Towards the end of the afternoon the participants were asked to come up with a list of top 10 tips and tricks. Two more were submitted to me by email soon after, so we have a dozen in total.

1. Verbatim
Google automatically looks for variations on your search terms and sometimes drops terms from your search without telling or asking you. Neither of these are very helpful if you are looking for a company or a person. Quote marks around phrases or individual words do not always force an exact match or inclusion in the search. If you want Google to run your search exactly as you have typed it in, click on ‘Search tools’ in the menu above your results, then click on the arrow next to ‘All results’ and from the drop down menu select Verbatim.

2. Reading Level
Try ‘Reading level’ if Google is failing to return any research or business related documents for a query. Run your search and from the menu above the results select ‘Search tools’, ‘All results’ and from the drop menu ‘Reading level’. Options for switching between basic, intermediate and advanced reading levels should then appear just above the results. Google does not give much away as to how it calculates the reading level and it has nothing to do with the reading age that publishers assign to publications. It seems to involve an analysis of sentence structure, the length of sentences, the length of the document and whether scientific or industry specific terminology appears in the page.

3. Guardian Data Store http://www.guardian.co.uk/data
This section of the Guardian posts articles, charts, graphs and maps on stories in the news using official government data, datasets collected and published relevant organisations and sometimes data obtained via Freedom of Information (FoI) requests. Links to the original datasets are provided so that you can download the raw data.

4. filetype:
Use the filetype: command to limit your research to PowerPoint for presentations, spreadsheets for data and statistics or PDF for research papers and industry/government reports. Note that in Google filetype:ppt and filetype:xls will not pick up the newer .pptx and xlsx formats so you will need to incorporate both into your strategy, for example filetype:ppt OR filetype:pptx, or run separate searches for each one. In Bing.com, though, filetype:pptx will pick up both .ppt and .pptx files.

5. site:
Include the site: command in your search to focus your search on particular types of site, for example site:gov.uk. You can also use -site: to exclude a site or group of sites from your search, for example:

potato yields forecasts 2013 site:gov.uk -site:www.gov.uk

to run the search on UK government web sites but excluding the new www.gov.uk site.

6. Duedil chart
Duedil (http://www.duedil.com/)  is one of several companies that repackage Companies House data and makes some of available free of charge. The workshop participants particularly liked the company Group visualisations.

Duedil company Group visualisation
Duedil company Group visualisation

 

7. Company Check http://www.companycheck.co.uk/ and Company Director Check http://company-director-check.co.uk/
Like Duedil, both of these services repackage Companies House data. Company Check provides 5 years of figures and graphs for Cash at Bank, Net Worth, Total Liabilities and Total Current Liabilities free of charge and lists the directors of a company. Click on a director’s name and you are taken to the Company Directory Check where you can view other current and past directorships for that person.

8. RSS feeds
Several of those attending the workshop already use, or are considering using, RSS feeds as a means of monitoring events and companies. Google is closing down Google Reader but Phil Bradley has lists of alternatives at http://philbradley.typepad.com/phil_bradleys_weblog/2013/03/20-alternatives-to-google-reader.html and
http://philbradley.typepad.com/phil_bradleys_weblog/2013/03/even-more-33-google-reader-alternatives.html

9. BL BIPC industry Guides
The British Library Business Information and IP Centre’s industry guides at http://www.bl.uk/bipc/dbandpubs/Industry%20guides/industry.html highlight relevant industry directories, databases, publications and web sites. Excellent starting points if you are new to the sector.

10. Domain Tools http://www.domaintools.com/
A useful tool for identifying who owns the domain name of a web site. Alarm bells should start ringing in your head if the owner is hiding behind an agent or a privacy protection service.

11. GBRdirect http://www.gbrdirect.eu/
A single point of access to the official company registries of 22 European countries. As well as searching for companies your can search company appointments and personnel for some countries, and verify VAT numbers. The amount of information that is disclosed varies depending on the country and details of what is available is included in the price list at http://gbrdirect.eu/priceList.aspx. The information that it finds will be in the original language.

12. Numeric range
This command is unique to Google. Use it for anything to do with numbers – years, temperatures, weights, distances, prices etc. Simply type in your two numbers separated by two full stops as part of your search. For example: world oil demand forecasts 2015..2030

This workshop is being held again on Thursday, 19th September 2013 in London. The content will have changed by then – in fact, some things have already changed! – and participants are encouraged to let us know the areas and topics in which they are particularly interested and areas of research that cause them problems. This enables me to tailor the event to the needs of those attending. Hands-on practical sessions are included so that everyone has a chance to try out the sites and techniques for themselves. Further details of the day are on the TFPL website.

Top tips for finding research information

Free Search Tools for Finding Research Information

This week I was in Canterbury leading a workshop and discussion on Google and Google Scholar for finding research information. Although the emphasis was on Google we also covered other specialist tools designed to search for scientific and research information. We also had an interesting discussion on h-index, other citation indices and services such as ORCID and ResearchGate. The slides for the session are available on authorSTREAM (http://www.authorstream.com/Presentation/karenblakeman-1706478-google-scholar-research-information/), Slideshare (http://www.slideshare.net/KarenBlakeman/scholar-research-information) and temporarily at http://www.rba.co.uk/as/.

Anyone who has attended one of my workshops knows that I ask the group to propose at the end of the session their top tips. These are the Canterbury group’s top 10 tips.

1. What’s going on?
Try and find out what’s going on behind the scenes and how the different search tools work. For example, Google and Google Scholar are quite different in the way they manage your search. Understanding how they operate means that you can adapt your search strategy accordingly and also manage your expectations; for example Google Scholar does not use the publishers’ meta data so author and date search are unreliable.

2. Personalisation and ‘unpersonalisation’
Google personalises your search based on past activity, who is in your social networks,and a whole host of other ‘stuff’. You can quickly ‘unpersonalise’ your results by using a separate browser window that does not use cookies or your web history as part of the search algorithm.

If you use Chrome as your browser, open what is called an incognito window. In the top right hand corner of your screen there is an icon with three lines. Click on it and from the drop down menu select New incognito window. Alternatively press the Ctrl Shift N keys on your keyboard

If you use Firefox, from the menu at the top of the screen select Tools followed by Start Private Browsing.

In Internet Explorer select Tools followed by InPrivate Browsing. If you cannot see InPrivate under Tools try looking under the Safety option.

3. Advanced search commands
Use Google advanced commands  such as filetype: to focus on PDFs, presentations, spreadsheets containing data and site: to look for information on just one site or a range of sites such as UK government. Although the advanced search screen has boxes for you to fill in for the commands the file format or filetype option is limited. It does not include options for the newer Microsoft Office formats such as .pptx and xlsx. Use filetype: as part of your search strategy, for example:

nasa dark energy dark matter filetype:pptx

Google Scholar commands are more limited – see slide 28 of the presentation.

4. intext:
Google automatically looks for variations on your terms and sometimes omits words from your search if it thinks the number of results is too low. Prefixing a term with intext: tells Google that it must be included in your search and exactly as you have typed it in. For example:

UK public transport intext:biodiesel statistics

tells Google that biodiesel must be included in the search and exactly as typed in.

5. Reading Level
Use Reading level if Google is failing to return any research oriented documents for a query. Run the search and from the menu above the results select Search toolsAll results and then from the drop menu Reading level. Options for switching between basic, intermediate and advanced reading levels should then appear just above the results. Google does not give much away as to how it calculates the reading level and it has nothing to do with the reading age that publishers assign to publications. It seems to involve an analysis of sentence structure, the length of sentences, the length of the document and whether scientific or industry specific terminology appears in the page.

6. Date options
In Google web search, use the date options in the menus at the top of the results page to restrict your results to information that has been published within the last hour, day, week, month, year or your own date range. Click on Search tools, then Any time and select an option. This works best with news, discussion boards, and blogs and web sites that use blogging software  to generate pages but Google is getting better at identifying the correct date of a web page.

Google Scholar handles publication dates differently. On the results page you can select a date range from the menu on the left hand of the page. Alternatively, you can run a Google advanced search and enter your publication years. However, Google Scholar looks for publication years in the area of the document where the date is most likely to be. As a result it may identify a page number or part of an author’s address as a year!

7. Google Scholar alerts
To be used with caution as the searches periodically stop without warning, and so have to be set up again, and they sometimes include documents that are several years old. Whatever your search you can set up an alert by selecting Create alert from the menu on the left hand side of the results page.

If the author has created a profile on Google Scholar, from their profile page you can follow new articles and/or new citations for that author. From past experience I warn you that this is not entirely reliable.

Google Scholar Follow Author

8. Metrics – top publications
Although it claims to search all scholarly literature Google Scholar does not always cover all of the key journals in a subject area. There is no complete source list but there is a top publications for subjects and languages under the ‘Metrics’ link in the upper right hand corner of the Scholar home page.

9. Microsoft Academic Search – visualisations
Microsoft Academic Search (http://academic.research.microsoft.com/) is a direct competitor to Google Scholar. The site is sometimes slow to load and it often assigns authors to the wrong institution. Nevertheless, the visualisations such as the co-author and citation maps can be useful in identifying who else is working in a particular area of research. The visualisations can be accessed by clicking on the Citation Graph image to the left of the search results or author profile.

Microsoft academic search citation graph
Author Citation Graph


10. Mednar visual
Deep Web Technologies has developed in conjunction with various institutions a number of science and research specific portals, some of which are publicly available. The sources that they cover are different but they all have similar search and display options. Results are automatically ranked by relevance but this can be changed to date, title or author. In addition to the standard relevance ranked list of results the portals create clusters of topics on the left hand side of the screen. The topics include broad subject headings, authors, publications, publishers, and year of publication and are a useful tool for narrowing down a search. Some of the portals, such as Mednar (http://mednar.com/), offer a clickable ‘visual’ of topics and sub-topics.

Mednar Macular Degeneration Visual

Top tips for business information

Here are the Top Tips for business information compiled by the participants of my latest business information workshop held on November 15th, 2012 in London. The set of slides that was the starting point for the workshop can be found on authorSTREAM at http://www.authorstream.com/Presentation/karenblakeman-1601945-business-information-key-web-resources/

  1. Zanran http://zanran.com/ A search tool for  identifying charts, graphs and tables of data within formatted documents such as PDFs, Excel spreadsheets and images. Enter your search terms and optionally limit your search by date and/or format type.  Zanran comes up with a list of  documents that match your criteria with thumbnails to the left of each entry. Hover over the thumbnail to see a preview of the page containing your data and further information on the document. Very useful if you are looking for industry statistics.

Zanran

  1. University library subjects guides. If you are looking for some good starting points on a subject seek out some university library subject guides. These list resources that are only available to their own students and staff but may also include links to relevant publicly accessible resources that have been assessed for quality.
  2. Socialmention http://socialmention.com/ Several social media search tools were covered in the workshop but this one received a special mention as a good general all round social media tool. It covers images, blogs, Twitter, Facebook, audi0 and bookmarks. If you are monitoring a topic you can set up email and RSS alerts.
  3. Companies House http://www.companieshouse.gov.uk/ The official registry for UK companies. Other services such as Company Check (http://companycheck.co.uk/) and DUEDIL (http://www.duedil.com/), which repackage Companies House data, may provide more information free of charge but it is always worth double checking with Companies House to see if there is more up to date information and to get a full of list of the documents that are available on a company. The history and list of documents that can be ordered for a company is informative in itself. On the Companies House web site use the Find Company Information to locate the company in the register and then click on “Order information for this company”. You will then see a list of available documents. Titles such as “Struck off and dissolved” and “Application for administrative restoration” would suggest that perhaps you ought to investigate further before doing business with the company.
  4. LinkedIn groups A couple of the workshop participants regularly use LinkedIn groups for research questions. Look for groups set up by professional and official bodies relevant to your subject.
  5. Twitter If you are looking for a professional, research or trade association that may be able to help with your research you only need to find just one organisation on Twitter covering your topic. Then, to find others that might be useful, see who that organisation is following.
  6. Millionshort http://millionshort.com/. If you are fed up with seeing the same results from Google again and again give Million Short a try.  Million Short runs your search and then removes the most popular web sites from the results. Originally, as its name suggests, it removed the top 1 million but the default has changed to the top 10,000. The principle remains the same, though.  Exclude the more popular sites and you could uncover a real gem. The page that best answers your question might not be well optimised for search engines or might cover a topic that is so “niche” that it never makes it into the top results.
  7. Biznar http://www.biznar.com/ Biznar is a federated search engine that runs your search in real-time in about 70 resources. There is a list on the Advanced Search screen where you can deselect individual or groups of resources. The results are combined into a single list and organised on the left hand side of the screen into folders such as Topics, Authors, Publications, Publishers and Dates. These are computer generated but can help you narrow down your search. A bit erratic at times and sometimes comes up with odd results but people still thought it was worth including in the Top Tips list.
  8. DUEDIL http://www.duedil.com/. This service repackages Companies House data and provides some of it free of charge. The feature that won DUEDIL a place in the Top Tops is the “Group” visualisation that illustrates the connections between the company you are researching, its parent companies and subsidiaries. You have to create an account (free at the moment) to access all of the information.

DUEDIL

  1. SCoRe http://www.score.ac.uk A catalogue of current and historical printed company reports held in UK libraries. The catalogue does not provide links to digitised documents but is a very quick and easy way of identifying libraries that hold hard copy reports. The participating libraries include London Business School, the British Library, Manchester Business School, City Business Library, Guildhall Library, Strathclyde University and the University of Warwick. A full list is available at http://www.score.ac.uk/collections.asp.