Preparing your result...
Loading...
Press Esc to dismiss this message

Method And System For Adaptively Weighting Full Text Search Results Of Structured Data (17-Dec-2009)

Thumbnail
IP.com Prior Art Database Disclosure (Source: IPCOM)
Disclosure Number IPCOM000191118D dated 17-Dec-2009
Originally published in Prior Art Database
Disclosed by: IBM
Country: Undisclosed
Disclosure File: 2 pages / 36.7 KB / English (United States)

A general mechanism to analyze the structure of a document or record, relevant user interfaces for data entry, and historical use, to adaptively infer the importance of particular fields. Once field importance is assessed, full text search hits can be weighted for higher quality search results.

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 53% of the total text.

Page 1 of 2

Method And System For Adaptively Weighting Full Text Search Results Of Structured Data

Many full text search algorithms exist for searching unstructured data-like web pages. They utilize clever techniques beyond just word frequencies to deliver accurate results. When applied to structured data, however, search engines do not adaptively weight search hits based on the importance of a field in a structured document or database. Some allow you to explicitly set the importance of a field to weight search results, but this is a manual process. This also requires knowledge of the document or database schema and must be updated manually as the schema changes.

This technique can be applied to any structured data, regardless of the actual document or database structure, and field importance assessments can be continuously re-evaluated and updated as the document or database structure changes.

This analysis of structured data can be broken down into three broad categories:

1. Field Definition: How a field is defined in a database schema, including field length and field type

2. Field Presentation: How fields appear in a form as part of an application's user interface
3. Historical Use: Analysis of existing data to determine how a field is used

Field Definition

Simply looking at field metadata reveals clues about how important a particular field might be. For instance, search hits for fields that have shorter maximum character lengths should be weighted higher than multi-line fields with longer character lengths. In a change management system, a defect's "summary" or "product" field values (fewer characters) might be weighted higher than "description" or "comments" (more characters).

Fields of certain types might be weighted higher. For instance, a field that references another record could be weighted higher than a field that represents a date. System fields such as "DBI...

(Source: IPCOM)
First page image
(Source: IPCOM)