INDEX
Explanations
proper nouns related to names, possibly of people or places
names and references associated with specific individuals or characters
New Auto-Interp
Negative Logits
Chem
-0.70
AMA
-0.68
Participant
-0.67
Hannah
-0.66
atches
-0.66
gestation
-0.65
Food
-0.63
-0.63
Polo
-0.61
Bastard
-0.60
POSITIVE LOGITS
doms
0.91
riks
0.90
Reloaded
0.81
hyde
0.79
awa
0.77
nil
0.74
":"/
0.74
ting
0.74
nery
0.74
igation
0.73
Activations Density 0.030%