INDEX
Explanations
occurrences of the word "twenty" and its variations
New Auto-Interp
Negative Logits
Activity
-0.78
Bank
-0.66
ULE
-0.64
Redemption
-0.60
ghai
-0.59
Caption
-0.58
etter
-0.57
Pwr
-0.56
GD
-0.55
GS
-0.53
POSITIVE LOGITS
thousand
1.16
eenth
1.11
Eight
0.92
fold
0.92
Thousand
0.91
eight
0.90
ousand
0.88
four
0.88
something
0.87
teenth
0.85
Activations Density 0.044%