INDEX
Explanations
names or terms related to specific individuals or entities
references to specific geographic or political contexts
New Auto-Interp
Negative Logits
ampires
-0.73
slightest
-0.72
bugs
-0.69
whiff
-0.67
outnumbered
-0.65
adish
-0.64
prostitutes
-0.63
mosquitoes
-0.63
stains
-0.63
lowly
-0.63
POSITIVE LOGITS
igsaw
0.95
endeavour
0.90
repertoire
0.89
continuum
0.83
effort
0.83
regimen
0.82
yssey
0.78
curriculum
0.78
strategy
0.77
issance
0.76
Activations Density 0.350%