INDEX
Explanations
phrases related to legal cases or conflicts
references to specific events or incidents
New Auto-Interp
Negative Logits
english
-0.68
Unix
-0.64
resid
-0.62
Currently
-0.61
ILCS
-0.59
registered
-0.58
AUT
-0.57
orf
-0.57
veland
-0.56
Currently
-0.56
POSITIVE LOGITS
debacle
1.10
fiasco
1.04
reminds
0.84
proves
0.83
inspires
0.80
horr
0.80
itself
0.79
underscores
0.78
ghazi
0.77
scare
0.77
Activations Density 0.534%