INDEX
Explanations
mentions of specific numbers and quantifiable amounts
New Auto-Interp
Negative Logits
[â̦]
-0.75
whilst
-0.73
().
-0.67
civilisation
-0.63
independ
-0.63
fucking
-0.63
('-0.62
Âł
-0.61
ali
-0.61
Contents
-0.60
POSITIVE LOGITS
meanwhile
0.84
meantime
0.75
swers
0.75
ibliography
0.70
Asked
0.69
DeVos
0.67
nonprofits
0.66
spokeswoman
0.66
echoed
0.64
aback
0.63
Activations Density 27.182%