INDEX
Explanations
the name "Gore" with varying magnitudes of relevance
mentions of the name "Gore."
New Auto-Interp
Negative Logits
akedown
-0.89
Constructed
-0.81
ivity
-0.77
istically
-0.73
ologically
-0.73
ively
-0.72
uously
-0.72
uing
-0.67
alogue
-0.67
onomic
-0.67
POSITIVE LOGITS
byss
1.34
stein
0.94
pedia
0.89
hound
0.85
tto
0.84
tti
0.84
cki
0.79
Verb
0.78
boro
0.78
bite
0.77
Activations Density 0.011%