INDEX
Explanations
proper nouns preceded by "Mr." (Mister)
titles or mentions of male characters
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨ
-0.78
actionGroup
-0.78
rawdownloadcloneembedreportprint
-0.69
anwhile
-0.69
destro
-0.69
SHIP
-0.67
damp
-0.64
Compared
-0.62
bars
-0.61
wills
-0.61
POSITIVE LOGITS
Universe
0.88
Incredible
0.76
imen
0.72
angelo
0.71
Claus
0.71
rahim
0.68
ude
0.67
gery
0.65
uv
0.65
ufact
0.65
Activations Density 0.035%