Posts Tagged ‘deep web’

DeepDyve Launches Publisher Tools

Wednesday, April 29th, 2009

Yesterday we announced several exciting new tools and products for the Publishing industry. As you know, we work closely with many publishers to make their expert content discoverable in our search engine. With the release of these tools, DeepDyve will now enable publishers to leverage the DeepDyve technology on their own sites. These tools are specifically designed for publishers whose content is already in the DeepDyve index and who want to bring custom search results (i.e. just their articles) to visitors on their site:
•    Publisher Landing Pages — Publishers can leverage DeepDyve’s technology to drive more traffic to their sites. DeepDyve will dynamically build landing pages for EVERY article in the collection. Each landing page is information rich with related article links and is optimized to be more easily found in the major search engines.

•    Search API — DeepDyve’s search technology can easily be added to a publisher’s site to make their content more findable. Through a web services Application Protocol Interface (API), simply ‘cut and paste’ the API code and allow DeepDyve to power the search.
•    More Like This Document API — This tool allows websites to directly interface with the DeepDyve database to search for articles that are related to any designated document within our index. The user simply clicks a MLT link or button and DeepDyve will return related article results using the designated document as the query (alternatively, the results can be streamed in automatically).
•    Content Highlight Widget — The Highlight Widget allows users at the publisher’s site to highlight any block of text up to 5,000 characters, then simply run that selection as a query.

“We are looking forward to DeepDyve powering the search on PLoS.org,” said Peter Jerram, CEO of The Public Library of Science.  “DeepDyve’s cutting-edge technology and ability to use entire sentences as a query will make it much easier for our users to find and discover new original research in science and medicine.”

Each of these tools is available for free with advertising revenue sharing, or for a fee which varies depending on volume. To learn more, please visit the Publisher section of our website.   You can also click here to read the entire press release.

###

About The Public Library of Science

The Public Library of Science (www.plos.org) is a nonprofit organization committed to making the world’s scientific and medical literature a freely available public resource. Its goals are to:
•    Open the doors to the world’s library of scientific knowledge by giving any scientist, physician, patient or student - anywhere in the world - unlimited access to the latest scientific research.
•    Facilitate research, informed medical practice and education by making it possible to freely search the full text of every published article to locate specific ideas, methods, experimental results and observations.
•    Enable scientists, librarians, publishers and entrepreneurs to develop innovative ways to explore and use the world’s treasury of scientific ideas and discoveries

New Scientific Research Available at DeepDyve

Monday, January 19th, 2009

One of the challenges of doing any sort of search on the Web is filtering out the noise.  In the case of doing research, the challenge is even larger as the information you are seeking is often very specific and authoritative.  A large focus for DeepDyve is in bringing these authoritative sources to you and we are pleased to announce the following content sources will be added to our index this month:

Welcome to DeepDyve

Thursday, January 15th, 2009

Greetings!  This is the inaugural post in our new blog that will be a forum for discussions about what’s on our mind – and yours – in the search and information industry.  The vision behind our company is that Search is in its infancy and today’s ‘traditional’ search engines meet only our most basic, albeit common, needs – the fat part of the long tail. However, doing research, looking for that needle in the long-tail haystack, requires a different approach.

In essence, good search of any kind comes down to the strength of the relevance algorithm, the quality and quantity of content to search against, and the ease of use of the product.  Whereas most search today is often narrow, ‘single-concept’ type queries (‘Obama’, ‘restaurant review’, etc.), research queries are inherently ‘multi-concept’ as the user wants to find broad and deep information about a complex subject (for example, starting with “pancreatic cancer treatment options, side effects, current research, clinical trials”; all the way to “effectiveness of selenium for pancreatic cancer”).

This has several implications:

  1. Relevance: there’s so much information on the Web that searching against all this data inevitably produces tremendous noise.  This problem is further exacerbated by search algorithms which use techniques such as PageRank which prioritize results based on popularity.  When you combine these factors, you end up with millions of hits where the top results are irrelevant and you have to sift for hours to find the hidden gems.
  2. Content: as mentioned above, the web is littered with content.  Unfortunately, for research purposes, the vast majority of content are news articles, amateurish and/or commercial.  Researchers want access to the best available information from authoritative sources ideally spanning general, introductory information all the way down to highly-specialized, peer-reviewed literature.
  3. Short/long queries: single-concept queries can easily be described in 3 words or less unlike multi-concept queries where researchers would like the flexibility of describing exactly what they are looking for in a natural language, or alternatively letting “the content be the query” by using a relevant article as an example for finding more articles related to that subject.  In addition, they would like to be able to search across multiple subject areas and not have to recreate their efforts from vertical to vertical.

Our mission at DeepDyve is to build a research engine that is focused on solving these challenges.  Our vision is that the paradigm of ‘research’ is evolving as more individuals take advantage of the incredible wealth of information potentially available to them not just from content sources but also from each other.  The challenge will be how to effectively make this information, and each other, easily accessible. We look forward to sharing our ideas with you on our vision and hearing your thoughts and suggestions as we embark on this journey.