INDEX
Explanations
words associated with lack or absence
New Auto-Interp
Negative Logits
ICAN
-0.74
jun
-0.72
Pwr
-0.69
concess
-0.66
Demand
-0.64
éŃĶ
-0.59
hinge
-0.58
ĪĴ
-0.58
OH
-0.57
depressive
-0.56
POSITIVE LOGITS
enhagen
0.86
nir
0.76
emort
0.73
ieu
0.69
heit
0.69
ère
0.68
adelphia
0.68
enthal
0.66
tissues
0.65
rette
0.65
Activations Density 0.018%