Instructions to use christofid/dapscibert with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use christofid/dapscibert with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="christofid/dapscibert")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("christofid/dapscibert") model = AutoModelForMaskedLM.from_pretrained("christofid/dapscibert") - Notebooks
- Google Colab
- Kaggle
dapSciBERT
DapSciBERT is a BERT-like model trained based on the domain adaptive pretraining method (Gururangan et al.) for the patent domain. Allenai/scibert_scivocab_uncased is used as base for the training. The training dataset used consists of a corpus of 10,000,000 patent abstracts that have been filed between 1998-2020 in US and European patent offices as well as the World Intellectual Property Organization.
- Downloads last month
- 11