INDEX
Explanations
references to iron and its various biological and health-related contexts
New Auto-Interp
Negative Logits
myſelf
-1.00
themſelves
-1.00
RenderAtEndOf
-0.99
ſeveral
-0.97
Popp
-0.91
himſelf
-0.90
pleaſure
-0.90
Monfieur
-0.90
whoſe
-0.90
kasarigan
-0.90
POSITIVE LOGITS
iron
3.32
Iron
3.17
Iron
3.02
iron
2.82
IRON
2.74
IRON
2.48
irons
2.13
Irons
1.82
铁
1.81
hierro
1.79
Activations Density 0.106%