INDEX
Explanations
expressions of magnitude or intensity
New Auto-Interp
Negative Logits
Lähteet
-0.89
<_>
-0.74
migrationBuilder
-0.63
almeno
-0.63
gynhyrchwyd
-0.60
permanentes
-0.59
曖昧さ回避
-0.57
seamnă
-0.56
ledig
-0.55
já
-0.54
POSITIVE LOGITS
מאוד
0.67
sekali
0.64
great
0.64
important
0.60
greatly
0.59
valuable
0.59
heavy
0.58
heavily
0.58
hugely
0.57
great
0.56
Activations Density 0.242%