INDEX
Explanations
mentions of the number "four" in various contexts
occurrences of the number "four."
New Auto-Interp
Negative Logits
idence
-0.74
Fed
-0.73
UGE
-0.70
Rica
-0.67
isky
-0.66
igation
-0.64
Happ
-0.64
Collider
-0.63
ller
-0.62
ustration
-0.60
POSITIVE LOGITS
teenth
1.77
teen
1.75
een
1.17
eenth
1.10
hundred
1.02
aciously
1.00
some
0.99
fif
0.98
months
0.93
thirds
0.92
Activations Density 0.031%