INDEX
Explanations
temporal expressions and context indicators
introducing context or modifiers
New Auto-Interp
Negative Logits
todella
-0.41
geval
-0.40
GTCX
-0.38
muhte
-0.36
gehouden
-0.35
ůr
-0.35
bepaalde
-0.35
gerçekten
-0.35
siis
-0.34
yani
-0.34
POSITIVE LOGITS
AndEndTag
0.77
tonode
0.63
rylic
0.53
########.
0.52
كومونز
0.51
насељу
0.49
verſ
0.48
delwed
0.48
ſcher
0.47
hobbies
0.46
Activations Density 0.038%