INDEX
Explanations
mentions of the word "fox."
references to foxes and terms related to minimization and maximization
New Auto-Interp
Negative Logits
Rated
-0.82
Interstitial
-0.73
Effective
-0.70
INK
-0.69
ISIL
-0.67
CI
-0.66
immersion
-0.65
Mour
-0.64
Franch
-0.64
externalActionCode
-0.64
POSITIVE LOGITS
ours
0.94
bour
0.83
clud
0.81
es
0.80
holes
0.76
pex
0.76
yip
0.75
erb
0.75
ples
0.74
elled
0.74
Activations Density 0.033%