INDEX
Explanations
mentions of the number two and other numbers reflecting quantity
New Auto-Interp
Negative Logits
çĻ»
-0.16
говоÑĢ
-0.15
bred
-0.14
avou
-0.14
eree
-0.14
usi
-0.14
ARSER
-0.13
minimum
-0.13
uz
-0.13
esen
-0.13
POSITIVE LOGITS
acco
0.15
ษ
0.14
onder
0.14
allet
0.14
apt
0.13
akeup
0.13
rog
0.13
aniel
0.13
anten
0.13
Raphael
0.13
Activations Density 0.047%