Guild icon
Teeworlds
IRC / bridge
One-way IRC channel bridge. If you want to be able to send messages to IRC, contact @Dune or @heinrich5991. https://www.teeworlds.com/?page=docs&wiki=rules/irc_rules
Between 2020-02-28 00:00:00Z and 2020-02-29 00:00:00Z
Avatar
[quakenet] Learath2 BOT 2020-02-28 12:46:09Z
is there a tokenizer somewhere in the tw source?
Avatar
there are tokens being generated.
12:48
oh. now I see what a tokenizer is ._. I'm dumb
Avatar
I don't recall seeing one
Avatar
system.c needs one :'(
Avatar
theoretically the splitting of strings by whitespaces is a form of tokenization
21:39
so in python "I am assa".split(" ") -> the split funktion is a tokenizer, but a bad one
Avatar
does the c++ standard library provide such simple functionality?
22:28
😮
22:29
ina simple way
Avatar
boost does
Avatar
oof, external dependencies
22:45
basically defining 20 template parameters to split a string 😄
22:45
boost classic
Avatar
just use streams I guess
22:46
ss std::stringstream classic 😮
Avatar
[quakenet] rand BOT 2020-02-28 22:55:57Z
it's time for strtok
Avatar
std::regex
Avatar
yeah regex for tokenization 😮
Avatar
well, if one does not care about performance at all
23:43
use python 😄
23:43
with regex
Avatar
no in python I use the tokenizer of sklearn or spacy
Avatar
to split string
23:44
bette rthan a whole state machine ._.
23:45
finite state machine
Avatar
there are dynamic and static tokenizers
23:45
i can give you literature if you want
Exported 30 message(s)