INDEX
Explanations
adverbs indicating ease or simplicity of an action
New Auto-Interp
Negative Logits
-
-0.63
Ker
-0.57
ra
-0.51
chữ
-0.50
ZY
-0.50
R
-0.49
itch
-0.48
Zag
-0.48
dare
-0.48
difficult
-0.48
POSITIVE LOGITS
مرئيه
0.94
énario
0.93
समीक्षक
0.85
saites
0.85
archiviato
0.84
'],
0.84
CONSIN
0.84
saraba
0.82
ویکیآمباردا
0.82
:✨
0.81
Activations Density 0.011%