数据标注是一款专门对文本数据进行处理和标注的工具,通过简化快捷的文本标注流程和动态的算法反馈,支持用户快速标注关键词并能通过算法持续减少人工标注的成本和时间。数据标注的过程先由人工标注构建基础,再由自动标注反哺人工标注,最后由人工标注进行纠偏,从而大幅度提高标注的精准度和高效性。数据标注需要依赖开源的数字底座进行人员岗位管控。
-
Updated
Jan 20, 2025 - Java
数据标注是一款专门对文本数据进行处理和标注的工具,通过简化快捷的文本标注流程和动态的算法反馈,支持用户快速标注关键词并能通过算法持续减少人工标注的成本和时间。数据标注的过程先由人工标注构建基础,再由自动标注反哺人工标注,最后由人工标注进行纠偏,从而大幅度提高标注的精准度和高效性。数据标注需要依赖开源的数字底座进行人员岗位管控。
A toolkit that makes it easier to write recursive-descent parsers in Zig.
C language lexer & parser & virtual interpreter from scratch in Rust
Lua Compiler, (De)Obfuscator, Minifier, Beautifier, And more
🔋 In-place lightweight XML parser
🔧 My studies on context-free grammar, using ANTLR4 (C++) to generate the parser files. Some basics are developed, such as token processing, recursion, variable definition, array processing, Abstract Syntax Tree (AST) manipulation, UNICODE support, and error handling.
This is a short and modern JIT compiler that transform source text, into LLVM IR bytecode that executes machine code at runtime. This project was developed at the hths.hacks() hackathon against more 250+ participants internationally and was placed as a winner. Among the winners, my project was the only one developed solo.
🔧 My studies involving context-free grammar analysis. The analyzers were built using familiar tools such as YACC, Lex and Bison. Topics covered include token filtering, simple variable manipulation, and arrays.
A toolkit that helps you to write your own parser.
A JS/HTML/CSS Toolkit(Tokenizer、Parser) Support Template Syntax
A Basic Experiment in Parser and Compilers and Stack VM . A basic stack based CPU with Assembly language and basic commands. A basic programming Languge Parsed to Tokens to e parsed to expressions to be compiled to assembly code to be executed on the virtual CPU... Also to be used to Parse English grammar to make abstract syntax trees.
Machine Learning approach to Bengali Corpus POS Tagging using BNLTK. This is an experimenting project under the mentorship of Prof. Sandipan Ganguly, HIT-K.
A README for my private CS 2112 Critter World Project
A tiny and complete tool to supercharge static JSON strings with dynamic, user-defined expressions.
Oxide is a hybrid database and streaming messaging system (think Kafka + MySQL); supporting data access via REST and SQL.
Write use-case specific parsers within minutes!
bkengine脚本的解析器(开源实现)基于python3.8.4
An automatic UML generator for Java that *actually works*
Python Token Tokenizer for SQL using Postgresql Keywords
Add a description, image, and links to the tokenizer-parser topic page so that developers can more easily learn about it.
To associate your repository with the tokenizer-parser topic, visit your repo's landing page and select "manage topics."