Change pre-trained model? #70

ggnicolau · 2022-03-25T14:48:32Z

I'm trying to create a spell checker proof-of-concept (POC) for an e-commerce search engine. We're already using Transformers architecture of other tasks and I thought about trying it also with spell checker.

I've came across this beatiful API and I want to give it a try. I've seen it uses BERT classical pre-trained model. But I need to use a pre-trained model in portuguese (such as 'BERTimbau') or multi-cross lingual (such as miniLM).

It would be good if we could pass the desired pre-trained model as a parameter for the function.

I may be wrong and it's already implemented. Correct me if I'm wrong. Is there an easy solution or where I can choose my pre-trained model without going low-level?

stale · 2022-04-24T15:13:24Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.

R1j1t · 2022-04-24T18:36:25Z

Hey @ggnicolau, apologies for the delayed response. The package supports the passing of custom models to spacy pipeline and/or contextual spellchecker. Can you please refer to the below snippet:

contextualSpellCheck/examples/ja_example.py

Lines 1 to 12 in 88bbbb4

    
           import spacy 
        
           import contextualSpellCheck 
        
           nlp = spacy.load("ja_core_news_sm") 
        
           nlp.add_pipe( 
        
               "contextual spellchecker", 
        
               config={ 
        
                   "model_name": "cl-tohoku/bert-base-japanese-whole-word-masking", 
        
                   "max_edit_dist": 2, 
        
               }, 
        
           )

If this does not solve the issue, please let me know, and we can work from there! Furthermore, from this issue, it seems that documentation (README) can be updated with this use case. I will update the label accordingly!

hardianlawi · 2022-11-27T05:52:37Z

I think the codes do not work for all pre-trained models. I tried changing bert-base-cased to roberta-large and it did not work for the example in the documentation

linhuixiao · 2023-08-01T05:30:13Z

Thank you very much. Due to area internet access limitations, the Bert_base_cased model can‘t be loaded from the internet automatically, this code solves my problem that loading the Bert model from the compute disk.

shoegazerstella · 2023-10-31T11:06:21Z

Hi, I am also trying this model bert-base-multilingual-uncased and it seem not to work.
I am aiming at a multi-language spell-checker.
Any tips on that? thank you!

ggnicolau added the enhancement New feature or request label Mar 25, 2022

stale bot added the wontfix This will not be worked on label Apr 24, 2022

stale bot removed the wontfix This will not be worked on label Apr 24, 2022

R1j1t added documentation Improvements or additions to documentation and removed enhancement New feature or request labels Apr 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change pre-trained model? #70

Change pre-trained model? #70

ggnicolau commented Mar 25, 2022

stale bot commented Apr 24, 2022

R1j1t commented Apr 24, 2022

hardianlawi commented Nov 27, 2022

linhuixiao commented Aug 1, 2023

shoegazerstella commented Oct 31, 2023

Change pre-trained model? #70

Change pre-trained model? #70

Comments

ggnicolau commented Mar 25, 2022

stale bot commented Apr 24, 2022

R1j1t commented Apr 24, 2022

hardianlawi commented Nov 27, 2022

linhuixiao commented Aug 1, 2023

shoegazerstella commented Oct 31, 2023