Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Abbreviation-Expansion detection must detect the following cases #158

Open
nuwandavek opened this issue Nov 25, 2020 · 1 comment
Open
Labels
nlp NLP backend modules

Comments

@nuwandavek
Copy link
Contributor

nuwandavek commented Nov 25, 2020

There are a few cases that I noticed while writing tests for the abbreviation-expansion detection, that the system currently does not handle. We could add these to the regression tests once we swap out the spacy model with an advanced BERT-based model.

  • Case 1 : Abbreviation not in parentheses
    text : "We use a Convolutional Neural Network, known as CNN, based architecture in this model, which is an improvement over state-of-the-art"
    gold : [(Abbreviation : 'CNN', Expansion : 'Convolutional Neural Network')]

  • Case 2: Expansion in parenthesis
    text = "GANs (Generative Adversarial Networks) outperform most generative models in the novel human face generation task."
    gold = [(Abbreviation : GANs, Expansion : Generative Adversarial Networks)]

@nuwandavek nuwandavek added the nlp NLP backend modules label Nov 25, 2020
@dykang
Copy link
Contributor

dykang commented Nov 25, 2020

@nuwandavek Do you have any numbers of how the scispacy model and the neural model work on those two cases you mentioned?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
nlp NLP backend modules
Projects
None yet
Development

No branches or pull requests

2 participants