INDEX
Explanations
phrases related to authorship or attribution in documents
instances of the word "for" in various contexts
New Auto-Interp
Negative Logits
soever
-0.75
[+
-0.67
quo
-0.65
alive
-0.64
hazard
-0.64
ynski
-0.63
nevertheless
-0.63
ÃŁ
-0.62
egu
-0.61
ptions
-0.61
POSITIVE LOGITS
geries
1.23
bidden
0.91
gery
0.87
inclusion
0.85
imeo
0.81
Collider
0.73
listeners
0.73
sale
0.70
viewers
0.69
ummies
0.69
Activations Density 0.208%