Text Normalization & Inverse Text Normalization
-
Updated
Jun 5, 2024 - Python
Text Normalization & Inverse Text Normalization
Streamlining Japanese-English Translation with Advanced Preprocessing and Integrated Translation Technologies
This is a pandoc preprocessor toolkit based on my experiment pdtmpl
SPARD programming language implementation
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
OCR, extract and classify documents. In addition, annotate documents and build your own NLP and Computer Vision models using Python by downloading the data. Find examples in our Colab Notebooks, e. g. how to fine-tune Flair.
A minimalist single-header library for building pattern-matchers, lexers, and parsers.
Python library for creating PEG parsers
A collection of Python scripts for common utility tasks including file manipulation, word counting, longest word detection, and grade categorization. Perfect for quick and easy solutions to everyday programming problems.
A platform enables sharing diverse knowledge, but similarly worded questions are common. We use NLP techniques to identify duplicate questions, enhancing user experience by making it easier to find high-quality answers.
Procedural macros for merging whitespace in const contexts
Vietnamese Input Method library
Python 📦 package for extracting quantity, units, and (sometimes) food names from unstructured recipe ingredients
A versatile CLI and Python wrapper for Google's Gemini Pro large language models. Streamline the creation of chatbots, generate dynamic text, analyze images and transcribe audio with ease.
A tool for identifying and censoring profanity in text
Simple command-line applications for generating passwords
Data processing utilities in keras3
A simple CLI filter to replace variables of the style `${KEY}` in text with their respective value.
Add a description, image, and links to the text-processing topic page so that developers can more easily learn about it.
To associate your repository with the text-processing topic, visit your repo's landing page and select "manage topics."