Flair: How to use Flair for segmenting text

Created on 16 Feb 2019  路  10Comments  路  Source: flairNLP/flair

Hi, I'd like to use Flair to experiment with putting paragraphs into text similar to http://aclweb.org/anthology/N18-2075 (I have some other segmentation problems to try but trying to begin with one where I can generate a lot of data with.)

However I'm unsure if the training format Flair needs supports this. I don't have to use the format used in the paper - I could for example produce a data file with one paragraph per line if necessary.

Any pointers would be appreciated.

wontfix

All 10 comments

You can train Flair's NER model for segmenting text and it works excellently well. Doing this will get you results similar to https://github.com/bedapudi6788/deepsegment/

Thanks @bedapudi6788 this looks perfect. I didn't realise I could do it with a NER model but I will go try it this week!

@emrul great, we'd be interested to hear how well it works!

@alanakbik , I have a trained Flair model for english that does segmentation for punctuated and unpunctuated text excellently. If anyone is interested, I can provide a link for the checkpoints.

Oh that is great - we'd appreciate if you could share the models!

@alanakbik

I can provide polish model developed with the same method if there is a need for that.

@krzynio we'd appreciated it if you shared it!

I鈥檇 appreciate a link to these also - starting on my model work this week

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

jewl123 picture jewl123  路  3Comments

Aditya715 picture Aditya715  路  3Comments

ChessMateK picture ChessMateK  路  3Comments

davidsbatista picture davidsbatista  路  3Comments

jannenev picture jannenev  路  3Comments