Full-Text Search Engine using MySQL
Keywords:
full-text, search, MySQL, index, search engine, ranking, metaphone, LevensteinAbstract
In this article we will try to explain how we can create a search engine using the powerful MySQL full-text search. The ever increasing demands of the web requires cheap and elaborate search options. One of the most important issues for a search engine is to have the capacity to order its results set as relevance and provide the user with suggestions in the case of a spelling mistake or a small result set. In order to fulfill this request we thought about using the powerful MySQL full-text search. This option is suitable for small to medium scale websites. In order to provide sound like capabilities, a second table containing a bag of words from the main table together with the corresponding metaphone is created. When a suggestion is needed, this table is interrogated for the metaphone of the searched word and the result set is computed resulting a suggestion.References
Fine-Tuning MySQL Full-Text Search - http://dev.mysql.com/doc/refman/5.0/en/fulltextfine- tuning.html
Full text search - by Wikipedia - http://en.wikipedia.org/wiki/Full_text_search
MySQL's Full-Text Formulas - by Database Journals - http://www.databasejournal.com/ features/mysql/article.php/3512461/MySQLs-Full-Text-Formulas.htm
Metaphone - by Wikipedia - http://en.wikipedia.org/wiki/Metaphone
The Levenstein distance - http://us2.php.net/levenshtein
Atlam, E.-S., Ghada, E.-M., Fuketa, M., Morita, K., Aoe, J., A compact memory space of dynamic full-text search using Bi-gram index, Computers and Communications, 2004. Proceedings ISCC 2004. Ninth International Symposium
Ikeda, T., Mano, H., Itoh, H., Takegawa, H., Hiraoka, T., Horibe, S., Ogawa, Y., "TRMeister: a DBMS with high-performance full-text search functions", Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference http://dx.doi.org/10.1109/icde.2005.148
Misutka, J., Galambos, L., "Mathematical Extension of Full Text Search Engine Indexer", Information and Communication Technologies: From Theory to Applications, 2008. ICTTA 2008. 3rd International Conference http://dx.doi.org/10.1109/ictta.2008.4530006
D. Zmaranda, G. Gabor, Issues on Optimality Criteria Applied in Real-Time Scheduling, International Journal of Computers Communications & Control, ISSN 1841-9836, Suppl.S, 3(S):536-540, 2008
Published
Issue
Section
License
ONLINE OPEN ACCES: Acces to full text of each article and each issue are allowed for free in respect of Attribution-NonCommercial 4.0 International (CC BY-NC 4.0.
You are free to:
-Share: copy and redistribute the material in any medium or format;
-Adapt: remix, transform, and build upon the material.
The licensor cannot revoke these freedoms as long as you follow the license terms.
DISCLAIMER: The author(s) of each article appearing in International Journal of Computers Communications & Control is/are solely responsible for the content thereof; the publication of an article shall not constitute or be deemed to constitute any representation by the Editors or Agora University Press that the data presented therein are original, correct or sufficient to support the conclusions reached or that the experiment design or methodology is adequate.