INDEX
Explanations
occurrences of the word "even"
New Auto-Interp
Negative Logits
Rosalie
-0.80
AIS
-0.77
PDA
-0.76
Kanu
-0.76
configureStore
-0.75
Danville
-0.74
Racine
-0.74
KOL
-0.74
HRC
-0.74
BPS
-0.73
POSITIVE LOGITS
even
1.53
even
1.29
Even
1.28
EVEN
1.27
Even
1.20
EVEN
1.12
Даже
0.95
Incluso
0.91
Même
0.90
Mesmo
0.90
Activations Density 0.081%