INDEX
Explanations
quotation marks
tokens that occur inside or adjacent to direct speech/quotation (dialogue).
New Auto-Interp
Negative Logits
coloured
-0.07
Tricks
-0.06
managed
-0.06
Watching
-0.06
Unternehmen
-0.06
�
-0.06
Ông
-0.06
ávka
-0.06
�
-0.06
newspaper
-0.06
POSITIVE LOGITS
ATT
0.06
bons
0.06
prostřed
0.06
dT
0.06
зн
0.06
ึ้
0.06
опыт
0.06
surg
0.06
_THREADS
0.06
objc
0.05
Activations Density 0.107%