Sentensa - a new open source search engine

Posted by VISITOR on Nov 24, 2005 10:20 AM EDT
www.sentensa.com; By Bo Lindstrom
Mail this story
Print this story

In September Virtual Genetics launched the new search engine, Sentensa, that can be downloaded, without cost, via Internet as Open Source. Virtual Genetics is now launching the next version of Sentensa, version 2.2 , that supports Java 5.0 with some new features.

Sentensa, has used 20 years experience in search technology to create the next generation search motor. Users have the ability to use totally new ways of searching amongst very large text databases. The name Sentensa was chosen in order to describe how that it takes into account search text’s underlying significance to find the most relevant search results. Searches become much faster and efficient, which make Sentensa applicable in many different markets, from the pharmaceutical industry to media and general document handling. Users can, with help from Sentensa, create advanced applications for searching in all kinds of text, which can be used by many users at the same time.

Sentensa is supplied as Open Source, which means that it is free of charge and may be downloaded through Internet. The new version supports java 5.0. As well as the indexing method and spider being much more efficient, the new version supports different languages. A number of new search methods are also included, says Bo Lindström, President, Virtual Genetics. We are constantly developing Sentensa with, for example an API, that is built on the popular protocol SOAP and support for SQL databases. This will released soon.



Technical Information Sentensa 2.2 is a platform independant and web browser-based search engine, totally written in Java 5.0. The system consists of a web-browser interface that can handle many users, a spider that runs through Internet, an indexing application and search engine. Sentensa supports amongst other features, probabilistic searching and ranking as well as traditional bibliographic access. Users have the use of a number of new functions and help tools to construct complex search arguments. The indexing technology used is a hybrid between hash-based and sort-inversion methods which has resulted in very effective and compact indexing.

The web interface is based on a modern servlet architecture that makes possible safe and effective access to the functions that the system offers. Java standard rmi is used for the client server protocol. Sentensa requires that a servlet-server is installed (for example Tomcat, also Open Source).Input data consists of XML-files that contain the records that will be indexed.Thespider converts, for example PDF files to XML that can then be searched.

For further information, contact Bo Lindström, President, Virtual Genetics, tel. +46-8-752 60 90 or E-mail bo.lindstrom@vglab.com

Sentensa, that has beeen developed by Virtual Genetics, Asimus and Contactor Data,is available as Open Source through http://www.sentensa.com. Also visit: http://www.vglab.com

  Nav
» Read more about: Story Type: Press Release; Groups: GNU

« Return to the newswire homepage

This topic does not have any threads posted yet!

You cannot post until you login.