INDEX
Explanations
mentions of legal actions or consequences
instances of the word "the."
New Auto-Interp
Negative Logits
ça
-0.70
nance
-0.69
racuse
-0.68
ional
-0.67
âĿ
-0.66
load
-0.65
pointers
-0.63
lander
-0.62
ometers
-0.62
zbollah
-0.62
POSITIVE LOGITS
guise
1.48
midst
1.34
meantime
1.21
hopes
1.20
wake
1.17
hope
1.16
pursuit
1.15
vain
1.13
manner
1.11
absence
1.10
Activations Density 0.165%