INDEX
Explanations
proper nouns related to various individuals
the word "who," indicating references to individuals or characters in various contexts
New Auto-Interp
Negative Logits
VIDEOS
-0.69
Checking
-0.66
Watching
-0.65
BOX
-0.65
PD
-0.64
Anyway
-0.64
Confederation
-0.62
Loading
-0.62
Okay
-0.61
AGES
-0.61
POSITIVE LOGITS
specializes
1.11
frequ
1.04
resided
1.02
oversaw
1.00
thri
0.99
fought
0.98
lived
0.98
wears
0.98
overcame
0.97
wore
0.95
Activations Density 0.118%