sematext

home · products · services · technology · clients · testimonials · jobs · about · contact
Sematext implements Open-Source Search, Natural Language Processing, and Text Analytics technology in the enterprise.

We focus on the design and development of scalable, high-performance search and solutions.

"We've worked closely with Sematext since day 1 of developing our Salesforce Content product. Their real-world expertise in designing and scaling Lucene and SOLR based solutions has proved invaluable."
-- Tim Barker, Sr. Director Product Management, Salesforce Content.
Recent projects:
  • Content-processing framework with Topic Classification, Named Entity Recognition, Sentiment Detection, and Key Phrase Extraction.
  • Custom Smart-Spellchecker functionality built on top of Solr's Spellchecker.
  • Solr master-slave cluster with distributed search, - fail-over and load balancer running on EC2.
  • Advertising click and impression log mining and reporting, utilizing Hadoop on EC2
  • Nutch and Solr-based country-wide search engine with cultural and linguistic awareness, utilizing Amazon's EC2 and S3
  • Solr performance and scalability advice for a very high-traffic SaaS provider
  • ...