Skip to content
@structflo

StructFlo

StructFlo

Open-source tools for early drug discovery, chemistry, and the life sciences.

License: Apache 2.0 Python 3.9+ GitHub Org


Libraries

Zero-configuration Named Entity Recognition for drug discovery, chemistry, and biological sciences.

Extracts compounds, targets, genes, diseases, bioactivities, and assays from scientific text using two complementary engines:

  • NERExtractor — LLM-powered extraction via Gemini or local Ollama models, with contextual understanding for novel entities
  • FastNERExtractor — Dictionary-based extraction using YAML gazetteers with exact, regex, and fuzzy matching (~393ms per abstract)
from structflo.ner import NERExtractor

extractor = NERExtractor()
results = extractor.extract(abstract_text, profile="CHEMISTRY")

Chemical Structure Entity Recognition A Deep Learning detector for identifying compound panels and their matching labels/IDs in scientific documents.

Detects bounding boxes containing rendered chemical structures and their associated label IDs (e.g. CHEMBL12345) across PDFs and images and full-page documents.

from structflo.cser.pipeline import ChemPipeline

pipeline = ChemPipeline()
results = pipeline.run("paper.pdf")

License

Most structflo libraries are released under the Apache License 2.0. Please review individual repos for any exceptions.

Pinned Loading

  1. structflo-ner structflo-ner Public

    Zero-config Named Entity Recognition for drug discovery, chemistry, and biological sciences.

    Python 2 2

  2. structflo-cser structflo-cser Public

    Chemical structure-label pair extraction from scientific documents.

    Jupyter Notebook 2 2

Repositories

Showing 3 of 3 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…