INDEX
Explanations
names of people or organizations
proper nouns, specifically names of individuals and locations
New Auto-Interp
Negative Logits
womb
-0.76
chem
-0.71
tune
-0.69
Hurricane
-0.62
USC
-0.57
Mandatory
-0.57
millennials
-0.57
trapping
-0.57
reins
-0.56
semester
-0.56
POSITIVE LOGITS
itsch
0.96
arde
0.96
cott
0.92
arb
0.91
zinski
0.90
igl
0.90
assi
0.88
apologised
0.88
ovich
0.86
chin
0.86
Activations Density 0.143%