chemu-ner-demo / README.md
mpkato's picture
Upload README.md with huggingface_hub
6974ba5 verified
|
Raw
History Blame Contribute Delete
1.14 kB

A newer version of the Gradio SDK is available: 6.19.0

Upgrade
metadata
title: ChEMU NER Demo
emoji: ⚗️
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.49.1
python_version: '3.12'
app_file: app.py
pinned: false
license: cc-by-nc-3.0
short_description: BioBERT-based NER for chemical patents (ChEMU 2020)

ChEMU NER Demo

Interactive demo for a BioBERT-based named entity recognition model fine-tuned on the ChEMU 2020 Task 1 chemical patent corpus.

The model (mpkato/chemu-biobert-ner) extracts 10 reaction-step entity types from patent text:

  • Compounds: STARTING_MATERIAL, REAGENT_CATALYST, REACTION_PRODUCT, SOLVENT, OTHER_COMPOUND
  • Conditions: TEMPERATURE, TIME
  • Yields: YIELD_PERCENT, YIELD_OTHER
  • Labels: EXAMPLE_LABEL

Held-out dev exact-match micro-F1 ≈ 0.95.

Usage

Paste any reaction description into the text box and click Extract entities. Three quick examples are provided at the bottom.

License

Training data (ChEMU 2020) is licensed under CC BY-NC 3.0; this demo and the underlying model are therefore for non-commercial research use only.