INDEX
Explanations
references to the quantity and involvement of individuals or groups in various contexts
New Auto-Interp
Negative Logits
mostly
-0.19
iversit
-0.17
always
-0.16
ensa
-0.16
tất
-0.15
siempre
-0.15
mostly
-0.15
Mostly
-0.15
vždy
-0.14
always
-0.14
POSITIVE LOGITS
simply
0.20
Simply
0.18
Simply
0.17
348
0.16
-times
0.15
gree
0.15
arda
0.14
simplement
0.14
ogg
0.14
already
0.14
Activations Density 0.138%