INDEX
Explanations
phrases related to conditions and requirements
New Auto-Interp
Negative Logits
ãģĻãģĻ
-0.17
ayan
-0.16
Certain
-0.15
ichier
-0.15
avoids
-0.15
Avoid
-0.14
ãģ°ãģĭãĤĬ
-0.14
ugu
-0.14
vf
-0.14
certain
-0.14
POSITIVE LOGITS
usual
0.41
normal
0.37
usual
0.35
typical
0.32
conventional
0.32
normal
0.32
обÑĭÑĩ
0.30
traditional
0.30
ordinary
0.29
normally
0.29
Activations Density 0.007%