Latent Semantic Indexing (LSI).
Latent Semantic Indexing (LSI) is a natural language processing technique which analyzes the relationship between a given set of documents and the containing terms. This process uses the term-document matrix which is used to describe the occurrence of a set of terms in a given document.
A little more mathematically, the term-document matrix is a sparse matrix in which the rows are for terms and the columns are for documents. The occurrence matrix is transformed to a relation between the terms and some concepts by the LSI. These concepts are then related to the documents. LSI is used to compare the documents in the concept space
———————————————————————————————————-
dBaseMedia.com | Search Engine Optimization | Lead Generation | Online Advertising
If you enjoyed this post, please consider to leave a comment or subscribe to the feed and get future articles delivered to your feed reader.













