INDEX
Explanations
terms related to ferro-magnetism
ferro then magnetic
New Auto-Interp
Negative Logits
ÁL
-0.42
Aprile
-0.41
حياته
-0.40
pektor
-0.40
Deckel
-0.40
Omer
-0.40
September
-0.40
wond
-0.39
Sep
-0.38
reta
-0.38
POSITIVE LOGITS
ferro
2.28
ferro
2.03
Ferro
1.98
Ferro
1.92
hierro
0.90
ferrous
0.87
iron
0.80
ferrous
0.80
ftagPool
0.72
Iron
0.71
Activations Density 0.012%