INDEX
Explanations
statements involving reporting or informing someone about an event or situation
instances of the word "that" in various contexts
New Auto-Interp
Negative Logits
quer
-0.76
hack
-0.65
ument
-0.61
bay
-0.57
Hispanic
-0.56
athon
-0.56
Bone
-0.55
gain
-0.55
Ò
-0.55
query
-0.53
POSITIVE LOGITS
Filename
0.73
they
0.67
atta
0.66
andon
0.62
THEY
0.62
dangers
0.60
beware
0.59
mbuds
0.59
ommel
0.59
advantageous
0.57
Activations Density 0.228%