Connect to John Snow Labs
John Snow Labs provides production-grade, scalable, and trainable versions of the latest research in natural language processing (NLP) through the following products:
You can integrate your Databricks clusters with John Snow Labs.
John Snow Labs does not integrate with Databricks SQL warehouses (formerly Databricks SQL endpoints).
Connect to John Snow Labs using Partner Connect
The Partner Connect steps cover the most popular NLP and OCR tasks:
Differences between standard connections and John Snow Labs
To connect to John Snow Labs using Partner Connect , you follow the steps in Connect to ML partners using Partner Connect . The John Snow Labs connection is different from standard machine learning connections in the following ways:
Steps to connect
To connect your Databricks workspace to John Snow Labs using Partner Connect , see Connect to ML partners using Partner Connect .
Connect to John Snow Labs manually
Follow these instructions to automatically install the John Snow Labs NLP and OCR libraries and notebooks on your cluster, and to activate your trial of John Snow Labs if you do not already have a John Snow Labs account.
Requirements
Before you integrate with John Snow Labs, you must have the following:
A Databricks cluster in your Databricks workspace.
A Databricks personal access token .
As a security best practice when you authenticate with automated tools, systems, scripts, and apps, Databricks recommends that you use OAuth tokens .
If you use personal access token authentication, Databricks recommends using personal access tokens belonging to service principals instead of workspace users. To create tokens for service principals, see Manage tokens for a service principal .
Procedure
To integrate with John Snow Labs, complete these steps:
Make sure you meet the requirements for John Snow Labs.
Go to the John Snow Labs NLP on Databricks webpage.
Click Install in my Databricks account .
In the Please tell us about yourself dialog, enter your first name, last name, and company email address.
For
Databricks instance url
, enter your
Databricks
workspace URL
, for example
https://dbc-a1b2345c-cloud.databricks.com/?o=1234567890123456
.
For Databricks access token , enter your Databricks personal access token value from the requirements in this article.
Click Test connection .
After the connection succeeds, for Choose a cluster to install on , select the cluster from the requirements in this article.
Click Get Trial License .
Check your email inbox for a message from John Snow Labs that contains a request to validate your email address.
In the message, click Validate my email .
After several minutes, check your email inbox again for another message from John Snow Labs that contains instructions about how to get started. Note that in some cases it could take up to a half hour for this message to arrive.
Follow the instructions in the message.
To manually install the John Snow Labs libraries and notebooks on your cluster, see the following on the John Snow Labs website:
- Install Spark NLP on Databricks
- Install Spark NLP for Healthcare on Databricks
- Install Spark OCR on Databricks
- John Snow Labs website
- Spark NLP
- Spark NLP for Healthcare
- Spark OCR
- John Snow Labs NLP Documentation
- John Snow Labs NLP on Databricks
To upgrade your trial of John Snow Labs, sign in to your John Snow Labs account, at https://my.johnsnowlabs.com/login .
Continue to next steps.
Next steps
Explore one or more of the following resources on the John Snow Labs website: