Category Archives: Search Engines

Exalead changes filetype commands

If you are a user of Exalead (http://www.exalead.com/search/) and use the filetype command you will need to make note of some changes to the file extensions. If you are looking for Excel spreadsheets you will now have to include ‘filetype:excel’ in your search strategy, for PowerPoint it is ‘filetype:powerpoint’ and for Word documents type in ‘filetype:word’. I assume that the changes are to ensure that the ‘new’ Microsoft Office 2007 extensions pptx, docx and xlsx are picked up. Alternatively, you could run just a keyword search and select the filetype from the menu down the right hand side of the results page.

In Google you have to run separate command line searches if you want to pick up both ppt and pptx files. The advanced search screen file format drop-down menu options only search for pre Microsoft Office 2007 file extensions. Bing does not seem to recognise the newer file extensions at all but you can search for them in Yahoo using the ‘originurlextension:’ command. Like Google, Yahoo’s advanced search screen file format box does not pick up the 2007 extensions.

Most people who use Microsoft Office 2007 generally convert files to 97-2003 format before uploading them to the web, but Office 2010 is well into beta testing and the new extensions will start to become more commonplace. It will be interesting to see if and how Google, Yahoo and Bing manage search for these new filetypes.

Online Information 2009 presentations

The three  presentations I gave at Online Information 2009 are now available on Slideshare:

Bing integrates Wolfram Alpha and out of beta in UK – allegedly

Hitting my RSS feeds this morning was the announcement from Bing that their UK version is out of beta. “So what,”  I thought. “Doesn’t look any different to me this morning”. But looking at the announcement in full I see that there is a plethora of new services that I can now enjoy. To start with:

“When you search for Football, what kind of answers do you expect to find. Well, I guess it depends on where you are doing the asking, if you are in the UK you probably don’t want to see NFL schedules. You probably mean what we in the US call soccer. Well today, millions of searchers in the UK can rest assured that Bing knows what they are talking about. We are excited to announce today that Bing in the UK is shedding its beta tag.  We want to congratulate our pals over in the UK on a huge milestone. You can now use Bing to make faster more informed choices on a daily basis.  Oh, and the next Manchester United game is on the 21st of November at 17:30 GMT (that’s 5:30 p.m. for us Yanks), in case you were wondering.”

My first reaction was that I wanted to be sick: I found this so patronising. We in the UK should be so grateful that Bing has finally realised that we have a life separate from the US and that Bing has taken the trouble to find out what we mean by football. Sorry, but I am not at all interested in football so if this is all you are offering as UK customisation then I nominate you for the #epicfail awards.

But let’s not be hasty. Let’s look at what else they have to offer.

“The daily Bing Homepage image and hotspots are something that now will be localized in the UK, with unique imagery and hotspots.”

At last!  The photos on the home page are of landmarks and locations in the UK and not of the Galapagos islands or Mongolia. This morning we had the Angel of the North and then the Avebury stones. The ‘hotspots’ option is now working and if you are interested you can find out more about the subject of the photo.

Bing UK

“Visual Search using visual images and metadata to make search more visual and more compelling.”

Pathetic! For a start there is no visual search on the home page. You have to click on the More option, which takes you to a page where it is listed. They seem to have deleted US stuff and and given us UK politicians under famous people, and Premier League Football Players and Professional UK football clubs under Sports. That is it. Where are the rugby clubs and cricket?  Oh, and under ‘More’ we have ‘Yoga poses’. Now I wonder why that is there? Could it be anything to do with the fact that there are only pictures of photogenic girlies in interesting poses that might possibly attract a lot of visitors to the site? Surely not. How about some gorgeous male hunks in interesting poses?!

“More Instant Answers. Get quick response answers and results to searches, such as how is Liverpool doing in the Premiership or which tourist attraction should I take my in-laws to at the weekend?”

Yet more football, but I thought I would try out their own search “How is Liverpool doing in the Premiership”. Bing did not come up with any easy to find information on this (I was assuming that the searcher would want to know where Liverpool is in the League Table). Google, however, had the official site of the premier league at the top of the results, which has a link to the current league table positions of all of the clubs.

Bing results

BingAnswers

Google results

BingvGoogleAnswers

“See who or what is being chatted about real-time with a global live Twitter feed with Bing Twitter search.”

#epicfail yet again I’m afraid. You have to know the URL of the Bing Twitter search because it is not listed on the UK home page or under ‘More’. Do not be too disappointed because it is a waste of server space, processing time and your time: see my blog posting Twitter search in Bing and Google

“Looking for the best deals?  – There is now an integrated shopping experience with Ciao UK. With Bing you can search the Internet to find the best prices, reviews and local availability.”

Bing didn’t do too badly on this one. We need a new frying pan and it came up with sensible results apart from the Keith Floyd biography “Out of the Frying Pan”. The best link, though, was one of the adverts for John Lewis.

BingShoppingUK


“With insights from our Multimap users, Bing Maps now offers new map styles, imagery and transit integration as well as draggable routes.”

In general the maps are fine. The Bird’s Eye imagery, which is equivalent to Google’s Satellite view, is higher resolution than Google’s and sometime more up to date. The ‘find a business option’ is as incomplete as Google’s. If you want to locate pubs, restaurants, plumbers etc in an area then go direct to Yellow Pages or Thomson Local. The directions for walking from my house to Reading railway station were sensible but it failed when I asked for Manchester Piccadilly railway station to Manchester Business School  (Google Maps had no problems). In fact, Bing Maps could not find Manchester Business School in any shape or form. As for “draggable routes” – no sign of them here.

“Bing has been built for the UK to help consumers get to key local sites and services in fewer links by including popular links, search boxes and suggestions within best match.”

If you are interested in football and shopping, then that might be true. It is certainly better than the US-centric stuff but overall still nowhere near as relevant as Google’s results.

Let’s move on to the announcement that Bing now incorporates results from Wolframalpha ( How Many Calories in a Burger? What’s 2^2^2^2^2? Bing and Wolfram|Alpha Have the Answers). This won’t take very long because I could not get it to work. I even tried the examples they give with the UK and the US versions of Bing and Wolfram Alpha is nowhere to be seen in the results. Has anyone managed to get this to work as described or has the integration not actually happened yet?

Having spent most of the morning struggling with Bing’s new features, and in some cases failing to find them at all, I was beginning to wonder if I had dreamt the dozens of announcements that littered my RSS feeds. I double checked and they are definitely there. Perhaps it’s a tech issue? I’m running Windows 7 on my main machine but the results are the same on Windows XP, and it makes no difference whether I run Firefox or IE. So I can only draw the conclusion that yet again Microsoft Bing has made a complete [expletives deleted] mess of everything. We could do with another half decent alternative to Google but Bing is just not in the same league.

    Internet Search: a challenging and ever changing landscape

    CILIP in the Thames Valley evening meeting

    Date & Time: Tuesday 6th October 2009,  1800 for 1830 hrs
    Location: Great Expectations, 33 London St, Reading

    Google threatens to go hyper with its “caffeine” search. Bing is taking over Yahoo. Image search options are expanding: creative commons, colour, similar images. More specialist search tools for the “hidden web” are emerging and Web 2.0 is now an essential part of the search mix. Karen Blakeman will look at the new services that are being pushed out by the major search engines and the alternatives.

    This is a free event followed by free refreshments and networking opportunities with colleagues.

    An invitation is extended to anyone with a professional interest in the topic

    Contact: Norman Briggs, nwbriggs@pcintell.co.uk to advise attendance for catering purposes.

    Directionlessgov: compare Directgov search with a Google custom search

    Checking through the last year of postings on my blog I regret to say that I somehow managed to forget to cover the excellent Directionlessgov. This is an “alternative” search option for the Direct.gov.uk web site and uses Google. It has has been set up by the group that is also behind They Work For You and The Government Says.

    “We got so fed up with the general uselessness of the multi-million pound shambles otherwise known as the Direct.gov.uk portal, that we decided to build something better in under an hour. Sadly, we ran catastrophically behind schedule, but we still finished before lunch.”

    Type in your search and view the results from Google on the right hand side of the screen. For comparison, and to rub salt and a hefty dose of chilli pepper into the wound, results from Directgov are displayed on the left hand side.

    A search on my own council’s (Reading) recycling policies came up with the following results:

    directionlessgov

    No contest: Directionlessgov wins outright!

    For another Google custom search engine covering local government, you might also like to try LGSearch, which I covered in March 2009.

    Searching for file types made easy

    One of the Top 10 Tips that participants of my advanced search workshops regularly come up with is using file format options to focus your search. If you are looking for an expert on a topic, a conference presentation or a quick overview of a topic then seek out PowerPoint files; government and industry reports are often stored as PDFs; and substantial collections of statistics may be left in Excel format. Both Google and Yahoo have options for file type searches on their advanced search screens, but if you want a quick and easy way of searching both of these search tools for the four main file types (Word, Excel, PDF, PowerPoint), then head for DocJax.

    Simply type your search terms into the box and DocJax will pull up a list of all four file formats in Yahoo and Google that contain your terms. You can then limit your search to just one file type by clicking on one of the four logos at the top of the list.

    DocJax

    I have only one minor quibble with DocJax, which is that it does not deduplicate the results. Other than that, it is an excellent tool for filetype searching. Many thanks to Peter Guillaume for alerting me to the service.

    If you prefer to search Yahoo and Google separately, then try Browsys Advanced Finder. Select Files form the menu at the top of the screen, enter your search terms and click on Yahoo or Google for your preferred file type. There is no need to re-enter your search terms for each search – just click your way through the list.

    BrowsysFiles

    I usually berate such services for not including Bing (formerly Microsoft Live Search) in their lists because Bing does sometimes come up with unique content. Although not included in Bing’s advanced search options one used to be able to simply incorporate the filetype: command followed by the file extension in the search. On testing it today, though, I discovered that the filetype command no longer works in Bing. Like the link and linkdomain commands, it has been obliterated from their search system. Another example of Bing dumbing down their search. This does not bode well for Yahoo: as part of the recent Microsoft deal, Microsoft will power Yahoo search and as a result Yahoo will lose many of its current search features. I’m afraid that rather than stealing market share from Google, Bing’s current approach to search will encourage users to stay with the big G.

    Free-to-use images might not be

    You may have already read that Google now includes a creative commons license filter option in its Advanced Image search screen. Creative Commons is a series of licenses that can be applied to a variety of works such as images, video and PowerPoint presentations and they specify what you can and cannot do with those works. Information on the licenses can be found on the Creative Commons web site at http://www.creativecommons.org/.

    Google does not use the CC terminology but has instead generated a pull down menu with the options: labelled for reuse, labelled for commercial reuse, labelled for reuse for modification,  and labelled for commercial reuse for modification.

    GoogleCCImages

    There is another option at the top of the list that is the default: not filtered by license. I had to think twice about this one because my first thoughts were that this was for public domain images. It is not. The “not filtered” option is all images. I ran the license options past a few people over the past week and they all immediately assumed that the default option is for images that you can use as you want.  A couple, though, then asked how “labelled for reuse” differed from this and then they became totally confused by the whole thing. To make it worse,  the licenses as listed by Google do not cover all the possible CC license conditions, for example attribution and share alike. So once you have done your search you still have to check the full license for the image that you wish to use. Furthermore, very few people are aware that you have to cite the license and any attribution as requested by the author.

    Google says in its help files:

    “By returning these search results, Google isn’t making any representation that the linked content is actually or lawfully offered under a Creative Commons license. It’s up to you to verify the terms under which the content is made available and to make your own assessment as to whether these terms are lawfully applied to the content.”

    The accuracy and validity of the Google implied license was raised recently in The Register: The tragedy of the Creative Commons . It comments:

    “Since there’s no guarantee that the licence really allows you to use the photo as claimed, then the publisher (amateur or professional) must still perform the due diligence they had to anyway. So it’s safer (and quicker) not to use it at all.”

    I disagree with that: I recommend using it as a first level filter but then check with the original web site regarding the details of the license. At least you won’t be spending hours wading through “all rights reserved” images.

    If you do use the license filter you will notice that many of the photos come from Flickr, which is owned by Yahoo!. Yahoo! has had a Creative Commons filter on its Image Advanced Search screen for a long time but only on the US site, not the UK. A far better way of searching CC Yahoo images is to go straight into Flickr at http://www.flickr.com/creativecommons/.  This gives you a description of the different licenses and you can search images assigned that license. This assumes, of course, that the person who has uploaded the image is the owner of that image and there are stories that this is not always so. But how paranoid do you have to be? With respect to Flickr my approach is to take the photographer’s word for it unless there are serious inconsistencies in their photostream, for example the  meta data associated with the photos suggests that they were in Armenia, New Zealand and Peru on the same day!

    So where do you go for images that really are free to use.  There is a trick you can use in Google  to pull up just public domain images. Carry out your search on the standard Image search screen and when the results come up add

    &as_rights=cc_publicdomain

    to the end of the string in your br0wser address bar, and press enter. (Thanks to Barry Schwartz at Search Engine Land for this tip) . The test searches I have tried so far come up with photos from NASA, US government sites and Wikimedia Commons.  NASA is a safe bet for public domain images as are US government web pages, although there are a few exceptions but these are clearly labelled with any copyright restrictions.. A recent spat between Wikimedia Commons and the UK’s National Portrait Gallery  – National Portrait Gallery bitchslaps Wikipedia: Hands off our photos! – has thrown suspicion on the validity of CC and public domain licenses attached to its photos. This appears to have been an isolated incident, though, and the high resolution images have now been removed if you are accessing the site from the UK.

    Another source of public domain images is MorgueFile, which is a small database of high resolution photos but you may have to play around with your search terms before you find exactly what you want.

    If you are looking for photos of buildings or locations in the UK then head straight for Geograph.  This aims to collect geographically representative photographs and information for every square kilometre of Great Britain and Ireland. Anyone can upload photos provided that they adhere to the guidelines and attach a Creative Commons Attribution-Share Alike 2.0 Generic license.

    Geograph has saved me so much time. A few months ago I was trying to find a photo of the Great Expectations pub in Reading, Berkshire. Google, Yahoo and Live (now Bing) insisted on giving me photos of people reading a copy of  Charles Dickins’s Great Expectations  while sitting in a pub in Berkshire. The image I wanted was probably somewhere in the list but I was not prepared to trawl through hundreds of results to find it. I typed in Great Expectations Reading into Geograph and I was there in a couple of seconds. Brilliant!

    GeographGreatExpectations

    If you are interested in finding out more about finding and using images head for JISC Digital Media – Still Images.

    Ixquick becomes Startpage

    Ixquick has changed its name to Startpage. The name and URL may have changed but the technology and way you use it is the same. Ixquick/Startpage is what is sometimes called a meta search engine, which I find misleading. It doesn’t search web page or document meta-data but takes your search and runs it across several tools at once and generates a single list of results. For the Web option it covers All the Web, Ask, Bing, Cuil, EntireWeb, Exalead, Gigablast, Lycos, Open Directory, Qkport, Wikipedia and Yahoo. All are automatically searched by Startpage but you can de-select individual services if you wish. Google is conspicuous by its absence.

    The great thing about tools of this kind is that it presents you with a single list gleaned from all of the search tools in the list. The disadvantage is that you cannot always use the advanced features of the individual services, although Startpage does have a go at addressing this. On the home page the Power Search option takes you to a page where there are boxes for “at least one of the words” (OR), “all the words”, “exact phrase” and “without the words”. There is also an expert search link that adds options for text in the title or URL, “at this domain name” (site search), pages “with links to this domain name”, and type of domain name (but only .com, .edu, .gov, .net, .org). Note that there is no filetype search. You can bring this screen up by default when you connect to Startpage by going to My Settings and selecting Expert Search as Homepage search mode.

    The site search seems to work well but the link option returns just a fraction of the true results and only displays the top 10. To find pages that link to a domain or specific page, searching direct on Yahoo or Exalead gives far better results.

    As well as Web Startpage does video search, which is powered by Blinkx.com, and allows you to restrict your search to categories such as business, news, sport and entertainment. The Pictures (image) option is disappointing and there is no information on which search engines are used to compile the results. The phone number option varies depending on the country you select but again there does not appear to any information on which telephone directories are used. Oddly, there is a “reverse” option to find a name from a telephone number even if you select the UK as a country. When you click on search, though, Startpage says the combination is not valid. Hardly surprising since reverse lookup is not available for public use in the UK.

    I don’t generally use tools such as Ixquick/Startpage or Dogpile. If I want to search quickly across multiple search engines I prefer to use something like Zuula or Browsys Advanced Finder and click through the tabs for each tool one by one. There are many people who do use them, though, and Ixquick has often been named as a favourite by people on my Advanced Search workshops. It is all down to what suits you and what works for your type of searches.  From the feedback that I have had over the years Startpage is definitely worth trying.

    Bing UK Round Table Meeting

    Hashtag: #meetbing

    Twelve bloggers, including myself and Phil Bradley, were invited to the round table meeting with Microsoft Bing in London on the evening of June 29th. The aim of the event, according to the email correspondence that preceded it, was to outline Microsoft’s plans for Bing in the UK and to obtain feedback and opinions from us. It wasn’t so much ’round table’ as around several tables. Laptops, netbooks and iphones came out at the start of the meeting and we all jostled for positions near power sockets. We were actively encouraged to tweet and blog before, during and after the event to our respective communities in order to spread the word and to elicit feedback from colleagues and friends. And we were provided with free wi-fi. This was definitely a step in the right direction and very different from similar search engine events that I have attended where you do not get through the door unless you have signed a non-disclosure agreement in blood, and you end up in a room that turns out to be a Faraday cage. So full marks to Microsoft on this aspect of the meeting.

    The plan was to have a presentation followed by a break, then another session with, I think, questions and feedback throughout. It started off according to timetable with an outline of what Microsoft is doing with Bing. As many UK searchers had already noticed the US version of Bing is very different from the one we have in the UK, which is still in beta. Microsoft wanted to “get traction” in the US first before developing the other country versions further. For example, they hope to be rolling out spell checking of queries in a few weeks.

    They then produced some statistics, one of which stated that only 1 in 4 searches delivers a successful result. I questioned where this data had come from and was told it had been collected from MSN search and the toolbar in IE. This then raised the question of how valid their data really is. It was soon after this that the programme fell apart as the questions started to flow fast and furious.

    One question was about the size of the Bing index. The Microsoft people dod not seem too sure about this but came up with a figure of approximately 10 billion pages. This surprised me as there have been several occasions when Live, Bing’s predecessor, has come up with unique pages that were not in either Google’s or Yahoo’s index. The index may be smaller, they said, but they are concentrating on quality – although no clues as to how they are doing that – and using humans and neural networks for “training” the ranking algorithms. They believe they will provide search results as relevant to Google in the UK in a few months. One piece of jargon that went completely over my head was tweeted by @leggetter: Microsoft are using NDCG to determine their search result relevance. If like me you haven’t a clue what that is, take a look at http://en.wikipedia.org/wiki/NDCG. In practice, people don’t care about the technology behind the results they just want relevant stuff on that first page and that is what is going to convince people that Bing is worth using.

    Microsoft confirmed that they are concentrating on the consumer market, which is obvious from the “verticals” that they are promoting in the US, for example suggesting hotels, comparing airline flight prices, comparing prices for various products. When I asked if they were going to provide a proper advanced search screen, they repeated their mantra of “concentrating on the consumer market” the implication being that “consumers” don’t need advanced search. I beg to differ. Google and Yahoo’s main audience is the consumer market. That is where they make  their money, but they still have a decent advanced search screen. Yes, you can do advanced search on Bing such as restricting to filetype but you have to know the commands. Most people acquire their searching preferences and skills at school, university, work or in the public library. If you get stuck you ask a colleague, the librarian or someone in the information centre what to do next. They may suggest a different search tool or show you how to use the advanced search screen boxes on Google or Yahoo. Filling in boxes is far easier for many people than having to remember and type in command line prefixes. And what people learn and find useful at school or in the workplace they continue to use when they search at home.

    It was around this point in the discussion that Phil Bradley said  “Nothing new here, nothing that excites me, nothing that is innovative”. And I have to agree with him. UK Bing at present is simply a rebadged version of Live.com. The Microsoft people admitted that it is not as good as it  should be but affirmed their commitment to its development long term. We shall have to wait and see. The UK team obviously passionately believes in the product but Microsoft does not have a good track record when it comes to following through on new initiatives. Academic Live was far superior to Google Scholar and its Live Books had content and advanced search features that Google Books did not. Both services were axed in May 2008.

    And finally there is the issue of censorship. Phil Bradley has repeatedly raised this with Bing directly, via his blog and on Twitter but so far has had no satisfactory response, and the Bing people at the meeting did not seem to know what is going on. For background on this see Phil’s blog postings The Microsoft Bing MeetingBing: excluding results from UK version? and More on Bing removing UK content.

    Overall, I am not sure what to make of the meeting. I suspect that Bing were expecting to be able do a straightforward sales pitch with a few easy questions from a tame audience, which we most definitely were not! I must congratulate the Bing people, though, for the cool way in which they handled the meeting. There was a lot of scribbling of notes on their side and promises to look into our concerns and questions. I had received several questions and comments from friends and colleagues but there was not enough time to raise them at the meeting itself.  Apart from the ridiculously short amount of time available to us  – we had two and a half hours  – there was a mixture of web developers, web 2.0 gurus and serious web searchers (Phil Bradley and myself) at the event and most of us had long lists of questions. I would definitely attend a second meeting but it would be more productive if they had separate events for developers and searchers.

    So would I recommend Bing as a search tool? Yes, but purely because I have always included Live.com in my list of essential search engines and Live now redirects to Bing. Bing is different from Live in the way it handles results but for some of my business searches it still sometimes pulls up unique results. (I will look at Bing search results in more detail in a separate posting). Is it a Google beater? I would love to see Bing give Google a run for its money but I can’t see it happening at present.

    Other blog postings on the meeting include The Microsoft Bing Meeting, Phil Leggetter – Microsoft Bing.com round table thoughts, Bing Roundtable: Where was the innovation? : David Stuart, Web Reflection: UK Bing Roundtable – Just My Opinion. Apologies if I have missed anyone.

    Meeting with Microsoft Bing

    I am attending a round table discussion with Microsoft on Monday 29th June to discuss their new search tool Bing. Phil Bradley will be attending as will other Web 2.0 and Internet commentators. If you have any comments or feedback on Bing, or would like me to ask a question on your behalf, do please let me know.

    We are being encouraged to blog, tweet etc the event so set your twitter search, alerting services or whatever to monitor the tag #meetbing