INDEX
Explanations
proper nouns and terms related to different individuals and entities
mentions of characters or elements related to a specific narrative or storyline
New Auto-Interp
Negative Logits
SAY
-0.75
TERN
-0.74
acher
-0.73
peanuts
-0.70
ependence
-0.65
IPM
-0.65
istant
-0.65
almonds
-0.64
ãĥĨãĤ£
-0.63
cul
-0.63
POSITIVE LOGITS
lain
0.93
boat
0.93
anqu
0.74
lets
0.74
ilts
0.73
boats
0.72
loo
0.72
CK
0.72
glers
0.71
Jenner
0.70
Activations Density 0.038%