INDEX
Explanations
the word 'opt' and its variations
instances of the word "opt" and its variations, indicating choices or preferences
New Auto-Interp
Negative Logits
INESS
-0.72
Danger
-0.71
flies
-0.71
nces
-0.68
borough
-0.65
Famous
-0.63
riage
-0.63
Granger
-0.61
bred
-0.59
Bam
-0.59
POSITIVE LOGITS
opt
1.01
atory
0.90
opting
0.87
uary
0.86
opted
0.86
imum
0.84
aye
0.84
atis
0.83
nir
0.83
ates
0.77
Activations Density 0.011%