INDEX
Negative Logits
Dump
-0.08
결국
-0.08
622
-0.08
Flg
-0.08
hemi
-0.08
áreas
-0.08
procur
-0.08
-area
-0.08
Laval
-0.08
notoriously
-0.08
POSITIVE LOGITS
explanations
0.12
explanation
0.11
Explanation
0.11
Explain
0.11
disclaim
0.10
Explanation
0.10
объяс
0.10
説明
0.10
elabor
0.09
narr
0.09
Activations Density 0.099%