INDEX
Explanations
an ongoing gradual change or transition
New Auto-Interp
Negative Logits
deal
-0.17
ute
-0.16
abus
-0.16
дел
-0.16
ÏīÏĥη
-0.15
vault
-0.15
alous
-0.15
862
-0.15
ouch
-0.14
anga
-0.14
POSITIVE LOGITS
ifiers
0.17
bjerg
0.16
Hind
0.16
enburg
0.16
appid
0.15
ceipt
0.15
ãĤ¤ãĥ³ãĥĪ
0.14
ERA
0.14
inski
0.14
ifier
0.14
Activations Density 0.027%