Questions tagged [search-engine]

49 questions
16
votes
1 answer

How is machine learning incorporated into search engine design?

I am currently building a small in-house search engine based on Apache Lucene. Its purpose is simple - based on some keywords, it will suggest some articles written internally within our company. I am using a fairly standard TF-IDF scoring as a base…
15
votes
5 answers

How to search for information related to Go programming language?

If there's something that would kill Go language, then it's name. I find it very hard to find anything in web related just by using "Go" in the query. The word is on of the most frequent in English. So I see everything in the result except…
Maksee
  • 2,653
  • 1
  • 16
  • 12
9
votes
4 answers

If I wanted to build a search engine, how would I start?

Can somebody point me into the right direction? I really don't have experience writing code but I'm a fast learner. I'm not trying to say it will be easy but I hope somebody can help me...
Ravi Sheth
  • 141
  • 1
  • 1
  • 4
8
votes
2 answers

How does Elastic Search approach the problem of distributed membership and consensus?

I already understand that Elastic Search is supposed to be deployed in a distributed topology, in that you can have multiple nodes for a cluster of ES instances. I like the API, and it looks promising. But I wonder how Elastic Search solves some…
8
votes
1 answer

Good technique for search text tokenization

We are looking for a way to tokenize some text in the same or similar way as a search engine would do it. The reason we are doing this is so that we can run some statistical analysis on the tokens. The language we are using is python, so would…
Chris Dutrow
  • 463
  • 1
  • 4
  • 9
8
votes
3 answers

Looking for good books about the theory behind search engines

I am working on a project that requires that I understand different techniques used by search engines for the web. I have a strong scientific and development background, so I am not afraid of highly technical information. I am looking for all forms…
sebpiq
  • 365
  • 1
  • 3
  • 8
7
votes
4 answers

Are there more efficient ways to search for help/documentation than Google/SO/...?

Google sometimes come up with irrelevant links, not everything is available on SO, there are cases where the local documentation is also annoying to look through... Are there other efficient ways you use to search?
Tamara Wijsman
  • 8,259
  • 14
  • 58
  • 94
6
votes
2 answers

Client-Side V.S. Server-Side Searching?

I am currently helping to design a web site and application in HTML. We would like the user to be able to search the site/app for desired content via a search bar. We would also like to include an advanced search ability to allow for different…
6
votes
2 answers

Data indexing frameworks fit for large E-Commerce applications

we wrote and still maintain a large E-Commerce application. Our feature list resembles what you would expect from most shops. We'd like to improve some of our features, and now the search/suggestion list functionality (enter some letters, a…
Dabu
  • 203
  • 1
  • 6
5
votes
1 answer

Is having a single `IndexWriter` instance in Lucene a good idea?

I am trying to understand how Lucene should be used. From what I have read, creating an IndexReader is costly, so using a Search Manager shoulg be the right choice. However, a SearchManager should be produced by a NRTManager(which, by the way,…
Dragos
  • 409
  • 4
  • 9
5
votes
1 answer

Search multiple tables

I have developed a web application that is used mainly for archiving all sorts of textual material (documents, references to articles, books, magazines etc.). There can be any given number of archive tables in my system, each with its own schema.…
kgilden
  • 521
  • 4
  • 11
4
votes
1 answer

Which is the best method for storing data for a Language Dictionary?

I have been working on making a completely offline dictionary using the Wiktionary XML dumps. The dumps themselves are about 10 MB, but when converted into a index using a search engine indexer (I use Whoosh Search Engine in Python), the complete…
user22662
4
votes
1 answer

What is the better search system, user defined tags or pre defined attribute fields?

I'm developing a marketplace application that will eventually list a large number of different types of items. Currently our search system (powered by Algolia) is built on pre defined attribute fields that must be filled out by the user when adding…
rt_
  • 161
  • 4
4
votes
2 answers

Small Search Engine Algorithm for Document Word Search

I have to design and implement an algorithm for my university project that searches a given set of documents based on the keywords/query given. Assume that each document contain few sentences and these documents can be stored in a suitable data…
4
votes
9 answers

Is there a useful correlation between choice of web-browser/search engine and good quality programmers?

After a round of interviews earlier this year, which included some practical questions with access to a computer, I noticed that, at least for the applicants we were seeing, there was a high inverse correlation between quality of applicant and the…
blueberryfields
  • 13,200
  • 8
  • 51
  • 87
1
2 3 4