INDEX
Explanations
references to influential figures or entities
the term "leading", particularly in various contexts and its associated phrases
New Auto-Interp
Negative Logits
bley
-0.71
atron
-0.69
apo
-0.67
ships
-0.66
bryce
-0.66
ilation
-0.66
ossession
-0.65
ALLY
-0.64
cia
-0.64
thur
-0.64
POSITIVE LOGITS
scorer
1.09
edge
1.07
contenders
1.02
indicators
1.01
contender
1.01
indicator
1.01
proponent
0.98
rusher
0.90
stone
0.86
exponent
0.84
Activations Density 0.032%