INDEX
Explanations
proper nouns and terms related to characters, especially in creative works like films, games, and books
references to characters in stories or narratives
New Auto-Interp
Negative Logits
Accessory
-0.71
Effective
-0.64
LOCK
-0.64
ribution
-0.64
ctr
-0.62
Kremlin
-0.62
IED
-0.60
unilateral
-0.60
Tant
-0.60
Wass
-0.59
POSITIVE LOGITS
acters
1.44
istically
0.90
hips
0.89
istics
0.82
hip
0.81
poons
0.78
Characters
0.77
characters
0.76
arcs
0.76
inhab
0.74
Activations Density 0.021%