Category Archives: Search Engines

Anything but Google – URLs

I omitted to include the URLs of some of the specialist tools mentioned in the Anything but Google presentation. You could Bing or Yahoo the names of the services (we’re not going to Google them are we?) but to save time I’ve listed them below.

ChemSpider – Database of Chemical Structures and Property Predictions
http://www.chemspider.com/
Owned by the Royal Society of Chemistry Chemspider links together compound information across the web and provides free text and structure search of millions of chemical structures. Search by systematic name, synonym, trade name, registry number, SMILES or InChI.

Biznar http://biznar.com/
Live federated search from Deep Web Technologies and covering 60 business collections. As well as presenting you with a standard list of results, the pages are organised into folders on the left hand side of the screen covering topics, authors, publications, publishers and dates (years).

TechXtra http://www.techxtra.ac.uk/
This is an initiative of Heriot Watt Universit providing a free service for finding articles, books,industry news, job announcements, technical reports, technical data, full text eprints, thesis and dissertations in engineering, mathematics and computing.

Scirus http://www.scirus.com/
Owned by Elsevier, Scirus covers scientific information. (See the About Us section for the full details). Some of the information is from free web resources but it also includes many priced articles.

PhilPapers: Online Research in Philosophy http://philpapers.org/
Directory of online philosophical articles and books by academic philosophers. Its purpose is “to facilitate the exchange and development of philosophical research through the Internet. Our service gathers and organizes philosophical research on the Internet, and provides tools for philosophers to access, organize, and discuss this research.”

Microsoft Academic Search http://academic.research.microsoft.com/
Currently concentrates on chemistry, computer science, engineering, mathematics and physics. It has advanced search options that actually work (unlike Google Scholar!), lists citations and has a wonderful Visual Explorer.

Not mentioned in the slides but discussed briefly during the session was HealthMash http://healthmash.com/. A semantic metasearch health search engine with “clustering and advanced linguistic capabilities.” I’d be interested in people’s experiences and views of this one.

Oi! Google – you have seriously overstepped the mark

Yes, I am talking to you Google and  this time you really have gone too far.

All I wanted to do was check up on the background of a photo I had taken of the wall surrounding the graveyard of a church in Reading. The church in question is St Laurence. We have all become accustomed to the “Did you mean….?” option at the top of our search results. I found it invaluable early in the morning or late at night when typos were inevitable in my search strategy: yes, thank you, I really did mean ‘widget manufacturers’ and not ‘wigdet manufacturers’. Recently, though, Google has abandoned the optional corrected search and now runs instead the corrected strategy as the default with yours as the extra option. Google has taken this a stage further and runs your search as it thinks fit.

So Google decided that I really meant to search for Saint Lawrence and has included that in the search. There is no option to search on just Saint Laurence:

Google St Laurence search

On this occasion there were some relevant pages in my results. But yes, Google, I really did want to search for Saint Laurence! Now, it seems, I have to prefix all of my search terms with a plus sign or enclose them in double quote marks to stop Google’s dictatorial behaviour.  But why should I have to do that?

In one of my presentations last year on Google vs. Bing/Yahoo I commented that Google would have to do something really stupid before users would switch to another search engine. For me, Google has done that really stupid thing. I am now seriously contemplating switching search engines for basic web searching. My final decision will be based on relevance of results and how quickly they are delivered. I have to spend too much time and click too many times to get them on Google

UPDATE: It has just got worse. I tried a search on the phrase “Saint Laurence” thinking Google would carry out an exact match search, but Google will have no truck with such obvious ploys. (Ignore the Twitter search at the top of the results screen – that is a Greasemonkey script add-on for FireFox).

Google search changes

I now have to click on the option for “Saint Laurence” to get results for the search I had originally requested. Putting a plus sign before my phrase in the search box does not change Google’s mind. “Excuse me, Google, but I do know what I am doing and when I tell you to carry out an exact match search I WANT AN EXACT MATCH SEARCH! Got it?”

ILI2010 – social search presentation

My presentation on using social media search tools as part of research, which I gave at Internet Librarian International on 15th October 2010, is now available on the sites listed below. I have uploaded it to several different sites and services as I know some of you are not able to access one or more of them at work.

PowerPoint Presentation (6.3 MB) (download from rba.co.uk site)
authorSTREAM
Slideboom
Slideshare

If you want to catch up with #iili2010 tweets there is a Twapperkeeper at http://twapperkeeper.com/hashtag/ili2010

Zuula – new interface and de-duplication

Zuula (http://www.zuula.com/) has a new interface and a new feature. Zuula provides an interface to many different search engines organised by type. Simply enter your search strategy, click on the type of information you want (web, image, news etc) and then click on the tabs of the search engines one by one to see their results. It is a quick and easy way to run a basic search through several tools in succession.

Zuula’s new interface is slicker and now automatically de-duplicates web search results. The first in the list is Google and you will notice that the results are numbered. Click on your next choice and you may notice that the numbered results do not start at number one.  At the top of the results list there is a plus sign and the text “Why minimized?” Zuula compares the results with your previous choice and “minimizes” duplicates under the plus sign. To see those results, click on the plus sign.

The other search types do not seem to support de-duplication but some are pulling in additional search features on the results page. For example images offers size, content (face, photo, illustration, line drawing) and colour. Some of the blog options offer restrictions by date (anytime, last day, last week, last month, last year).

You can change the order of the search engines under Preferences and also increase the number of results per page to a maximum of 60.

If you haven’t tried it out already give Zuula a go now.

Advanced search tips and tricks

An interesting list of search tips came from the participants of the search workshops I recently ran in-house for a well known academic institution. (My Twitter followers will be able to work out who it was). As well as being experienced, savvy searchers they are fortunate in that they can choose which browser to use for searching. Attempts to demonstrate Google Instant failed, however. I was not able to show Google’s latest “enhanced search experience” in action, even when using the latest versions of the browsers and being signed in to a Google account. This was probably due to their firewall. Personally, I think that is a plus for the institution. Some of you may disagree.

Here is their combined top search tips list.

1. Keep it Simple!

There is a plethora of advanced search options and Google alternatives but starting off with a simple search string is often the best approach. Looking for data on the UK rat population? You might be tempted to include a file format limitation in your search and/or a site:gov.uk command but simply typing in a search for uk rat population statistics was quicker and came up with the relevant information. Note: the simple approach worked at the time with this example because it was a “hot topic” in the UK news. It might not work now, which brings us to number 2…

2. Be aware of personalisation and hot topics

The major search engines monitor what you search for and the links you click on, and use this to “personalise” your results and sponsored links/ads accordingly. This information is stored in cookies on the computer you used for the search. They also try and work out your location from your IP address so that they can deliver local content (this sometimes goes horribly wrong!). What is currently hitting the headlines will also be a factor in determining the results that are displayed on the first page (increase your displayed results per page to more than the default 10 and ideally to at least 50). This means that you will see different results from one day to the next and if you use a computer other than your usual machine.

3. Google isn’t infallible

We covered a range of search techniques that you can try to bring Google to heel but if you are not getting anywhere try another search tool. Google does not cover everything and your best result may be number 1,200,675 in the results list. Try Yahoo or Bing as alternatives and also think about using specialist search tools for real time and social media, images, and subjects/industries.

4. Get to know the Google alternatives

There is no easy way to do this but visiting Zuula (http://www.zuula.com/) or Browsys Finder (http://www.browsys.com/finder/) once very couple of weeks will remind you of the alternatives and alert you to new kids on the block.

5. Google additional search options

Open up and explore the additional Google search options on the left hand side of your results page. You can restrict your search to news, videos, blogs, images etc and to a time period. There are also options for related searches, less or more shopping sites and….

8. The Wonderwheel

Use this to extract phrases and concepts from the top results and to change the direction of your search. Worth investigating if you are stuck in a rut and fed up with seeing the same results again and again.

9. Google Public Data Explorer

This is currently a Google Labs project at http://www.google.com/publicdata/home “..makes large datasets easy to explore, visualize and communicate. As the charts and maps animate over time, the changes in the world become easier to understand.” There is a list of sources at http://www.google.com/publicdata/directory but the data available is more varied than the list suggests at first glance. The World Development Indicators and OECD Factbook are worth looking at in more detail to see if they have data that can help with frequently asked questions.

10. Creative Commons and public domain images

If you are looking for an image for a presentation or promotional literature, search for images that have the appropriate Creative Commons license. There are several licenses with varying degrees of restrictions. Details are on the Creative Commons web site at http://www.creative.commons.org/.  You can search Flickr photos that have a specific creative commons license at http://www.flickr.com/creativecommons/ or use Compfight (http://www.compfight.com/). There are several other sites you can use for Creative Commons images but Geograph (http://www.geograph.org.uk/) was mentioned several times by the workshop participants. Geograph “aims to collect geographically representative photographs and information for every square kilometre of Great Britain and Ireland” and all photos have a CC 2 license, which means that they can be used commercially with attribution.

11. TinEye Reverse Image Search
http://www.tineye.com/

Type in the URL of an image or upload one of your own and TinEye will find similar images, how it is being used, if modified versions of the image exist, or if there is a higher resolution version. Provided by Idée Inc who also offer..

12. Multicolr Search Lab
http://labs.ideeinc.com/multicolr/

Search 10 million Creative commons Flickr images by colour. You can specify more than one colour and click on a colour several times to increase its prominence within the image. You can easily click through to the original Flickr image to double check the license.

13 . Slidefinder

http://www.slidefinder.net/

Ideal for locating individual presentation slides that contain your search terms. There is an Advanced Search that enables you to search specific areas of a slide for example title, text, notes. You can also limit your search to a university. There are browsable lists at the bottom of the page but they do not list every institution: there are only 47 for the UK. One workshop participant had been given a paper copy of a complex slide and it had taken her “ages” to find an electronic version. She had had to wade through hundreds of slides in presentations that had been identified by using the advanced filetype: ppt search. Slidefinder found it straight away.

14. Twitter search tools

Do not expect Google, Yahoo or Bing to carry out a reliable Twitter search. Use specialist search tools such as Twitter Search (http://search.twitter.com/), Twazzup (http://www.twazzup.com/), BackTweets (http://www.backtweets.com/) for tweets that refer to your content, Tweepz (http://www.tweepz.com/) for finding people and organisations on Twitter, and TwapperKeeper (http://www.twapperkeeper.com/) for archives of tweets on a conference hashtag or keyword.

15. Google custom search engine

http://www.google.com/cse/

Ideal for groups or collections of sites that you regularly search and use. Google CSE is very quick and easy to set up and can be hosted on Google. Two that had been set up by a workshop participant were a list of library associations worldwide and selected UK higher and further education web sites.

16. Watchthatpage

Tracking changes to web pages that do not themselves offer RSS or email alerts was not covered by the main part of the workshop but the question arose during one of the practical sessions. There is a list of some web based and downloadable programs and their features at Tracking Web Page Changes http://www.rba.co.uk/sources/monitor.htm . Watchthatpage (http://www.watchthatpage.com/) won the vote because it is free, web based and offers email alerts.

17. Evernote

http://www.evernote.com/

“Capture anything… Type a text note. Clip a web page. Snap a photo. Grab a screenshot. Evernote will keep it all safe.”. I don’t use this myself but it had several fans in this organisation. ( I use Firefox add-on Scrapbook to do a similar thing).

18. Add-ons for Firefox

If you are a Firefox user explore the many add-ons that are available to make searching and managing information easier. For example Feedly (https://addons.mozilla.org/en-US/firefox/addon/8538/) to organize your favourite sources into a magazine-like start page;  Scrapbook (https://addons.mozilla.org/en-US/firefox/addon/427/) to save and organize web pages; and Optimize Google (https://addons.mozilla.org/en-US/firefox/addon/52498/) for customizing your Google searches and results.

19. Don’t re-invent the wheel – re-use and share

As well as images, many presentations have Creative Commons licenses and their authors are often happy for you to re-use slides from them as long as you acknowledge the source and do not incorporate them into a product or service that you then sell. Slideshare.net is a good starting point but do check the license to confirm what you can and cannot do with the content – not all are CC. Also, consider assigning a CC license to your own photos and presentations. The Creative Commons web site (http://creativecommons.org/choose/) can help you decide which one to use.

20. Time to explore

There was time to explore new techniques and tools during the workshop but it is not so easy to try out, for example, a new option on Google when you are back in the office and an enquirer wants that result NOW! Try and incorporate some “play time” into your schedule so you can keep up with new developments, even if it is just 10 minutes a week.

Seriously irritating things about Google Instant

Having had a few more hours to explore Google Instant there are four things that I find seriously annoying about it:

1. The way the suggestions and results are displayed is so messy and busy. AlltheWeb’s LiveSearch implementation was so much slicker and easier to follow. A pity that Yahoo did not follow through on that one but they never have taken really good experimental stuff further.

2. You only get 10 results per page regardless of what you have on your Settings page. This is a major problem for me because I have my display set to 100. I don’t trust the first  results in a Google search to be – er, how shall I put it – unbiased, and I want to be able to quickly scan through at least 30 or 40 to get an indication of whether or not I need to modify my strategy. Having to keep clicking for the next page is going to drive me up the wall. I can understand, though, that allowing everyone to have more than 10 results per page would probably slow down the processing and display of results.

3. The Wonderwheel has gone from Google Instant results. I don’t use this feature that often but it does sometimes help me narrow my search or to branch out in a completely different direction.

4. It messes up several of my Firefox add-ons, in particular OptimizeGoogle. Google SearchWiki (now defunct) did exactly the same.

I have now turned off Google Instant. It offers me no benefits that compensate for the loss of features and options.

Many people are also complaining that the ability to turn off query suggestions has now disappeared (thanks to Paul Chapman for bringing this to my attention – see his comment to my initial Google Instant  review  at http://www.rba.co.uk/wordpress/2010/09/09/google-instant-display-results-as-you-type/comment-page-1/#comment-8637). You can still do it if you use Google SSL at https://encrypted.google.com/ but that is no help whatsoever if you want to use a country version of Google as I often do. To be honest I rarely pay any attention to the suggested queries and most of the time I start my search from the Google Toolbar where I have suggestions turned off. But if you really do not want query suggestions or it causes technical problems, and Google does not reinstate the turn-off option, the main alternatives are Yahoo or Bing. Both still allow you to switch it off.

Google Instant – display results as you type

No, Google hasn’t branched out into groceries – yet. Google Instant is not a brand of coffee but a new search and display feature that shows changing results as you type your search. Google says that it is actually display before you type because it tries to predict your full strategy and delivers results for that search. As you add more terms the predictions and the results change:

“Google Instant is search-before-you-type. Instant takes what you have typed already, predicts the most likely completion and streams results in real-time for those predictions—yielding a smarter and faster search that is interactive, predictive and powerful.

The list of predicted searches – they are the same as Google Suggest – appears below your search box. If you spot a better strategy you can scroll down the list to select it.

Google Instant

I found that Google does eventually run out of predictions. In some cases it was after only three terms: in others it took seven or eight before Google gave up but carried on changing the results as I typed in extra terms. If you are a more experienced and advanced searcher who uses search commands such as ‘filetype:’ or ‘site:’ you are suddenly presented with a blank page. This totally confused me at first and I thought that Google simply did not have any results for my search. In these situations Google reverts to ‘old style’ search, so just carry on as normal and press enter to view your results.

Note: You have to be signed in to your Google account to see Google Instant.

Not everyone will have Google Instant right now:

Google Instant will become the core search experience on Google.com for Chrome, Firefox, Safari and IE 8. We’ll also be offering Google Instant to our users in France, Germany, Italy, Russia, Spain and the U.K. who are signed in and have Instant-capable browsers. Over the coming weeks and months, we’ll work to roll out Google Instant to all geographies and platforms.”

I am guessing that IE 6 is not included in the “all geographies and platforms” as Google has already withdrawn support for it in some of its other services, for example YouTube.

The idea is not new. AlltheWeb – owned by Yahoo – was trying out a similar approach with its Livesearch a few years ago. I found it extremely useful because you could quickly spot if you had a gone a search term too far. The progression might go: OK-ish results, relevant, even better, superb, total rubbish. It was then easy to remove the last term you had typed in to get back to your superb results list. When further development of AlltheWeb stopped Livesearch was discontinued.

Alltheweb Livesearch

Another good idea abandoned by Yahoo and later taken up by someone else. Some of you may also remember Yahoo Mindset which gave you a slider bar to change the emphasis of your results to find more shopping or research oriented pages. Google now has a fewer/more shopping sites option in the left hand menu on its web results pages.

My first impressions are mixed. Sometimes the predictions work, sometimes they don’t and I don’t find it as easy to take in the changing display as I did with AlltheWeb Livesearch. I think that is because Livesearch had the search box on the left hand side of the screen and I find it easier to glance across the page rather than down to monitor what is happening to my search.

Find Google Instant distracting and want to turn it off? Either sign out of your Google account or click on the Settings link in the top right hand corner of the screen. The option to turn it off is at the bottom of the Settings screen.

Further information is available on the Official Google Blog –  Search: now faster than the speed of type
http://googleblog.blogspot.com/2010/09/search-now-faster-than-speed-of-type.html

IFEG Advanced Search, Statistics & Market Research

I have now uploaded the slides for my workshop at the Information for Energy Group (IFEG). As usual, I have uploaded them to several different web sites in case one or more are blocked by corporate firewalls. If you have problems accessing any of the locations, let me know and I’ll sort out some other means of getting the presentation to you.

Workshop: Advanced Internet Searching for Energy Information & Market Research
Organised for:
Information for Energy Group
Venue: The Energy Institute, New Cavendish Street, London.
Date: Thursday 13 May 2010

PowerPoint Presentation (download from the RBA site – 7.5 MB)
authorSTREAM
Slideboom
Slideshare

Another workshop – another Top 10 Search Tips

The participants at the latest advanced search workshop were all from the public sector and had very strong views on some of the new developments in search. They were definitely not impressed by Google automatically enabling web history with a view to “personalizing” search results. (See Your Google results are about to get weirder
http://www.rba.co.uk/wordpress/2009/12/17/your-google-results-are-about-to-get-weirder/). (The workshop participants  are switching off Web History as soon as they get back to the office!) There were several sites and search features, though, that did impress them. This is their list of Top 10 Search Tips.

1. The Google Wonderwheel was the clear winner of the day with this group. When your results page appear on screen, click on “Show options” just above the results and to the left of the screen. Then select Wonderwheel from the list on the left of the page. (For further details see Google new search and display options
http://www.rba.co.uk/wordpress/2009/10/05/google-new-search-and-display-options/)

2. Google’s Timeline was a close second in the popularity stakes. This is also under Show options in Google when you do a default web search and is also available in Google News. It shows the distribution of your articles over time and gives you an idea of when something started to become a “hot topic” and how a story has developed over time. It is not 100% accurate but is good enough to give you an overall picture of how interest in a subject has waxed and waned.

3. LGSearch http://lgsearch.net/ They liked this one a lot! This a Google Custom Search Engine (CSE) set up by Dave Briggs (http://davepress.net/) that searches UK public sector web sites in one go. On the results page you can, if you wish, narrow down your search further to Local Government, Central Government, Health, Police & Fire, LG Related or Social Media.

4. Slideshare http://www.slideshare.net/. A site used by many people and organisations to provide access to PowerPoint presentations. Search for presentations on any topic or by a specific person then view online or download the original if the author permits. Once you have selected a relevant presentation Slideshare also shows you a list of other presentations containing similar content. No registration required if you just want to search.

5. Try something else other than Google. As well as giving Yahoo or Bing a go, try and think about the type of information you are looking for: news, video, statistics, what people are talking about. Then use the appropriate search tool for that type of information.

6. Twitter search http://search.twitter.com/ You may not want to indulge in Twitter yourself but it can give you an idea of what people are saying about a topic. It is also an essential part of reputation monitoring and competitive intelligence: what are people saying about you or your products and services? You do not have to have a Twitter account to search Twitter, just go to search.twitter.com.

7. Google Blogsearch (http://blogsearch.google.com/) and Blogpulse (http://www.blogpulse.com/) Blogs are another useful source of views and opinions on every topic imaginable. Blogpulse has a “trend this” option on the results page that displays a graph showing you how many blog posts mention your search terms over time.

8. Zuula.com (http://www.zuula.com/) for quick and easy access to a wide range of search tools covering different types of information. Enter your search once, click on the tab for the type of resource (video, images, reference, news), and then work your way through the list of search engines.

9. Google Custom Search Engines (CSE). We looked at several Google CSEs, LGsearch.net and Directionlessgov (http://directionlessgov.com) being just two of them. You can, though, set up your own CSE at http://www.google.com/cse/. Useful if you search the same web sites day after day. You will need a Google account or Gmail account to set up a CSE but you can host your CSE on your own web site or on Google. CSEs can be made public or kept private.

10. University of Auckland Official Statistics (OFFSTATS)  http://www.offstats.auckland.ac.nz/ This set of web pages provides information on Official Statistics on the Web and is an excellent starting point for official statistics by country and subject/industry.