INDEX
Explanations
references to locations or places
New Auto-Interp
Negative Logits
rape
-0.18
rez
-0.17
nte
-0.16
raid
-0.16
reste
-0.15
wayne
-0.15
ship
-0.15
aceous
-0.15
ium
-0.15
ists
-0.15
POSITIVE LOGITS
bos
0.19
-temp
0.16
HOLDER
0.16
inx
0.15
picker
0.15
Bain
0.15
upon
0.15
ividual
0.14
-names
0.14
Combined
0.14
Activations Density 0.067%