DigitalPebble Ltd is a consultancy and solution provider specialising in web crawling, natural language processing, machine learning and search.
We advise, evaluate and implement solutions based on leading open source software, such as StormCrawler, Apache Nutch, GATE or Elasticsearch. We aim to combine open source tools to provide efficient, reliable and low-cost made-to-order solutions.
Our unique expertise covers all aspects of documents life cycle, from web-wide crawling and collection, content analysis, filtering and categorization to indexing. We are specialised in large scale processing using Apache Hadoop or Apache Storm and have expertise in cloud platforms such as Amazon AWS, which has allowed us to successfully deploy solutions scaling up to billions of documents for our clients.
Not only to we have an extensive knowledge of open source software, we are also active contributors and provide some of the resources that we have developed over the years under open source licenses.
DigitalPebble's director, Julien Nioche, is a member of the Apache Software Foundation and a long standing committer to Apache Nutch. Julien is a contributor and committer on several other open source projects as well as a conference speaker.
Our clients range from startup in stealth mode to NASDAQ listed companies and operate in domains as varied as business intelligence, media monitoring, telecommunications or software development.