INDEX
Explanations
phrases indicating personal reflections or experiences
New Auto-Interp
Negative Logits
GEBURTSDATUM
-0.76
TagMode
-0.72
Numerade
-0.69
awtextra
-0.68
лтемелер
-0.66
⟬
-0.65
thâu
-0.63
MessageTagHelper
-0.63
новниш
-0.63
Havolalar
-0.62
POSITIVE LOGITS
reflective
0.36
readers
0.33
шпа
0.32
ネタ
0.32
topic
0.31
posting
0.31
detailed
0.31
feature
0.30
thread
0.30
bec
0.30
Activations Density 1.502%