INDEX
Explanations
mentions of tech companies and electronic products
followed by punctuation
negative or ending states
New Auto-Interp
Negative Logits
aarrggbb
-0.70
متعلقه
-0.68
ftagPool
-0.68
الإنجليزية
-0.64
parsedMessage
-0.63
########.
-0.63
MessageTagHelper
-0.60
makeText
-0.56
*{\-0.55
المناصب
-0.55
POSITIVE LOGITS
anymore
1.18
unless
1.08
nor
1.07
unless
0.94
any
0.90
akaan
0.88
tampoco
0.83
nici
0.81
enää
0.79
apapun
0.78
Activations Density 0.688%