INDEX
Explanations
proper nouns, likely names of characters in a story
New Auto-Interp
Negative Logits
natureconservancy
-0.73
Else
-0.72
uberty
-0.66
stem
-0.63
wcsstore
-0.61
IRO
-0.60
Kin
-0.60
BIT
-0.59
wrench
-0.59
LV
-0.59
POSITIVE LOGITS
ess
1.18
enance
1.17
esses
1.07
ies
1.03
s
1.03
ries
1.01
ing
0.92
ships
0.86
sburg
0.84
hips
0.84
Activations Density 5.627%