INDEX
Explanations
not have the ability to burn
New Auto-Interp
Negative Logits
Fir
-0.10
Wagner
-0.10
Amar
-0.09
arsen
-0.09
Fee
-0.09
ca
-0.09
Fon
-0.09
ÑĥлÑİ
-0.09
Extr
-0.09
ute
-0.08
POSITIVE LOGITS
fuel
0.19
burn
0.18
burn
0.18
çĩĥ
0.18
burning
0.18
comb
0.18
cháy
0.17
sto
0.17
fuel
0.17
Comb
0.16
Activations Density 0.039%