tokenizer

A grammar describes the syntax of a programming language, and might be defined in Backus-Naur form (BNF). A lexer performs lexical analysis, turning text into tokens. A parser takes tokens and builds a data structure like an abstract syntax tree (AST). The parser is concerned with context: does the sequence of tokens fit the grammar? A compiler is a combined lexer and parser, built for a specific grammar.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tokenizer

Here are 1,075 public repositories matching this topic...

lggruspe / ipa-tokenizer

prashantksharma / nlp-js

hanusri / Tokenization-and-Stemming

DethRaid / Roy_VnTokenizer

snakeinthegarden / priorityTokensGenerator

yogeshwaran-shanmuganathan / Twitter-Sentiment-Analysis

rakesh-racharla / Custom-Resume-Screener

sonalipatil21 / Interpreter-Development

HumaSejdini / NLP-CoronaTweets

muhammadshaffay / Roman-Urdu-Tokenizer

Feohr / bfi

BChip / trippi-CS451

TheProv1 / Java-Codes

Shubbair / GPT4-Tokenizer

yazaldefilimone / letter.parser

ihesvm / simexer

vuthaihoc / coccoc-tokenizer-rest-api

fauzanzaid / Lexer-in-C

JTatton / JSON-basic-cpp

raviqqe / nltokeniz.py

Related Topics