INDEX
Explanations
New Auto-Interp
Negative Logits
beaver
-0.56
wives
-0.53
بيها
-0.51
stork
-0.51
opra
-0.51
waukee
-0.50
betweenstory
-0.50
ruptedException
-0.50
:✨
-0.49
ⓧ
-0.49
POSITIVE LOGITS
<bos>
0.66
عنها
0.59
AutoScaleMode
0.56
InputLabel
0.56
côtes
0.54
dataclass
0.53
GetResponse
0.53
freiheit
0.52
Cyfeiriadau
0.52
ZoneId
0.52
Activations Density 8.955%