Products & Services
Data Extraction and Web Scraping APIs
We are experts in getting structured information from the web and we offer public APIs to instantly extract data in JSON format from popular websites such as indeed.com, monster.com etc. All of our APIs come with a free plan which lets you test it before committing to a paid plan.
Search Engine Scraper APIs
- Bing Scraper API: scrape search results, also known as search engine results pages (SERPS), from Bing.com search engine using an API.
- Google Scraper API: scrape search results, also known as search engine results pages (SERPS), from Google.com search engine using an API.
- Yellowpages Scraper API: scrape search results from yellowpages.com search engine using an API.
Jobs Scraper APIs
- Indeed Scraper API: scrape job listings from indeed.com using an API.
- Monster Scraper API: scrape job listings from monster.com using an API.
- Glassdoor Scraper API: scrape job listings from glassdoor.com using an API.
Scrape Store Locations API: This lets you scrape store locations of retailers and other businesses such as Walmart, Publix, Chick Fil A, Domino’s Pizza etc. using an API.
Craigslist Scraper API: Scrape real estate posts from craigslist using an API
You can hire us for our managed web scraping plan if the website domain you wish to scrape does not have a publicly available API.
We use big data tools to continuously crawl the Internet and aggregate useful domain level data described below.
News API: Access Specrom’s media monitoring and news database which covers latest news (less than 24h) over 20000 news sources and over a million journalists, bloggers and influencers.
Email Database API: Access Specrom’s email database of over 200 million email addresses
Domain Authority and Ranking: Specrom Domain Ranking and Authority API provides a numerical rank and a score from 0-100 for 66 million domains to quantify relative domain authority compared to other website domains.
Text Analytics APIs
We have a wide range of cloud based text analytics APIs which can used as a backend in your workflows to analyze data obtained by our web extraction and other Data APIs.
We have a full range of social listening tools that enable you to track keywords across news websites, blogs, forums, social media sites such as Twitter, Reddit and customer reviews on ecommerce and other platforms such as Yelp, Trustpilot, Amazon, Walmart.com, etc.
Also check out our free wordcloud generators:
Specrom Alerts for Social Listening and Media Monitoring
Specrom alerts lets you track keywords mentions over one million news outlets, blogs, forums and social media sites on a real time basis. Some examples of keywords are your name, brand, competitors, events, products or product categories.
You can set the frequency of alerts, and get a report sent directly to your inbox. There are also advanced filtering options to view only the keyword mentions that are most relevant to you and your business along with ability to set negative filters which can filter out websites if they are not relevant for you.
We have historical data going back till 2015 so you can not only spot the trends before they’re trending but also see how trends peak and fall.
Why are we better than Google alerts?
You may already be using Google alerts for your media monitoring; let us tell you why Specrom Alerts are better than plain Google Alerts
|Specrom Alerts||Google Alerts|
|Coverage||Blogs, Forums, websites and social||Blogs, Forums and websites|
|Filters||Want international major outlets only? Your choice with easy filter options for types of result sources||No option to filter result source|
|Wordcloud and advanced graphics||Y||N|
|Historical Data||Till 2015||N|
How Specrom Alerts helps businesses?
Compare your marketing efforts against the competition by tracking the best-performing keywords, trends and themes which can help you predict your competitors’ next moves.
Increase awareness about how audience is perceiving your business and discover brand influencer, bloggers, and reporters to help spread your message.
Get a high level overview of the competitive landscape so that you can flag opportunities and anticipate future threats to stay ahead of the competition.
Managed Web Scraping
We offer fully managed web scraping service for all your use cases and needs. We run web crawlers which index over 100 TB a month of webpages and our sophisticated natural language processing algorithms extract structured information out of it for applications in price monitoring, lead generation, technology trends analysis, alternative financial analysis among numerous other use cases.
Data Analytics Consulting Services
Why you should hire us?
We offer full range of services in data science and analytics using state of the art machine learning and artificial intelligence algorithms. All our data scientists have M.S. and phD degrees from top American universities and have published in top journals and conferences. We are proud to have worked with wide range of clients, from small businesses to fortune 500 companies and US federal agencies. Contact us now for an no obligation free consulation with our expert data scientists.
Evaluating off the shelf analytics products:
In last 3-4 years, there has been an explosion of cloud based machine learning powered APIs for image and text analysis (over 30 providers) as well as pretrained/default models such as H2o.ai apart from traditional desktop products such as KNIME, RapidMiner, SPSS etc.
All of these were trained on different types of datasets, and some may work quite well for your needs. Instead of letting your in-house team identify the best one by trial and error and spending both time and money, rely on our experience and let us help you find the best option quickly.
We can help you scope your requirements and based on your end use, we can quantitatively identify the best off the shelf products within your set budget. Even though we also sell text analytics APIs, in case our competitor APIs work better in your case than we will not hesitate to recommend it.
Data Annotation and Model Development:
Publicly available text analytics and image recognition AI models are usually trained on corpuses such as wikipedia, news articles etc. and will have very high predictive ability only if that data is somewhat similar to what you use in your business area.
Quite frequently, documents from domain areas such as finance, law, healthcare and pharma industry perform poorly on pretrained models. This could be quickly fixed by hiring us to label/annotate a dataset representative of your use case. We will use it to custom models using state of the art approaches and integrate that with Specrom Text Analytics Platform.
Data Analytics Integrations:
Advanced machine learning models are useless if not integrated tightly with your business’s existing workflow. We have worked on wide type of integrations such as:
- Analyze any text data stored in columns in microsoft excel, libreoffice and google sheets.
- Integrating with apps such as dropbox and slack through zapier so that whenever a team member of our client uploaded into a dropbox, it triggered our text analytics APIs and results were pushed to a dedicated slack channel.
- Integrating our APIs into Elastic or within data processing pipelines in Knime.
Education and Training
We offer training workshops of varying durations (10-100 hours instructions) in specific focus areas:
Machine Learning: Classification and regression (linear, logistic, support vector machine, random forest), deep learning (convolutional neural network (CNN/ConvNet), recurrent neural networks (simple RNNs, LSTMs), cluster analysis, feature engineering. Analyzing unstructured data using Natural-language processing (content and knowledge based recommender systems).
Statistical Methods: hypothesis testing (ANOVA, t-test) and confidence intervals, correlation (bivariate, partial, distances), time series, principal component analysis and dimensionality reduction.
Software toolkits: Python (scikit-learn, keras, pandas, matplotlib, numpy, scipy, NLTK, spaCy), R (shiny, knitr, ggplot2, tidyverse, caret), SQL (MySQL, SQLite, MongoDB), Apache Spark (MLLib), Apache Hadoop, Weka, Eclipse RCP/Java, KNIME.