INDEX
Explanations
references to the politician Al Gore
mentions of Al Gore
New Auto-Interp
Negative Logits
akedown
-0.76
olog
-0.69
ivity
-0.68
Constructed
-0.67
uously
-0.64
riage
-0.63
ablished
-0.61
uating
-0.61
uing
-0.61
imen
-0.59
POSITIVE LOGITS
byss
1.20
Gore
1.09
pedia
0.86
vine
0.78
bite
0.76
achev
0.76
hound
0.75
boro
0.74
bage
0.74
contrace
0.72
Activations Density 0.003%