INDEX
Explanations
names and titles related to individuals or characters
New Auto-Interp
Negative Logits
kefeller
-0.70
ruary
-0.69
tire
-0.68
achine
-0.64
essor
-0.62
ORTS
-0.61
tyre
-0.60
orphans
-0.58
Reviewed
-0.57
cigars
-0.56
POSITIVE LOGITS
idis
0.70
Wonderland
0.70
vez
0.70
Pacific
0.68
umeric
0.67
è¦ļéĨĴ
0.67
iop
0.66
iris
0.65
=-=-=-=-=-=-=-=-
0.63
¥µ
0.63
Activations Density 0.180%