INDEX
Negative Logits
dov
-0.11
iku
-0.10
neau
-0.10
Credit
-0.10
amber
-0.10
inde
-0.09
gang
-0.09
autom
-0.09
alie
-0.09
OSS
-0.09
POSITIVE LOGITS
provide
0.10
HIR
0.10
hope
0.10
cung
0.10
answer
0.10
providing
0.10
æıIJä¾Ľ
0.10
å¸ĮæľĽ
0.09
hope
0.09
proporcion
0.09
Activations Density 0.076%