INDEX
Explanations
phrases that indicate conditions or limitations on actions or rights
New Auto-Interp
Negative Logits
ttemberg
-0.69
guenos
-0.63
ViewFeatures
-0.62
ModelExpression
-0.61
EPIC
-0.60
cellaneous
-0.56
¡¡
-0.56
ariConfig
-0.56
··
-0.55
)))))
-0.55
POSITIVE LOGITS
лишь
1.01
merely
0.84
แค่
0.78
Only
0.75
only
0.75
脚注の使い方
0.74
Only
0.73
only
0.73
Apenas
0.73
pouze
0.72
Activations Density 0.251%