INDEX
Explanations
phrases indicating estimates or approximations of quantity
New Auto-Interp
Negative Logits
uled
-0.17
istani
-0.16
agara
-0.15
lá
-0.15
/stretch
-0.15
etc
-0.13
lein
-0.13
tha
-0.13
.bb
-0.13
ses
-0.13
POSITIVE LOGITS
lier
0.17
adel
0.15
inis
0.15
LY
0.14
akk
0.14
;y
0.14
eo
0.14
EO
0.14
mesmo
0.14
885
0.14
Activations Density 0.084%