INDEX
Explanations
linguistic markers and structure in the text
New Auto-Interp
Negative Logits
aye
-0.17
aki
-0.16
prav
-0.16
jerne
-0.14
exampleInput
-0.14
Class
-0.13
anomal
-0.13
brid
-0.13
gameTime
-0.13
orge
-0.13
POSITIVE LOGITS
Bereich
0.21
Beitrag
0.19
Gang
0.18
punkt
0.17
ismus
0.16
ivism
0.16
Blick
0.16
Ort
0.16
Countdown
0.15
Einsatz
0.15
Activations Density 0.037%