INDEX
Explanations
regulation of body temperature
New Auto-Interp
Negative Logits
hotter
0.60
холод
0.58
hot
0.55
chaude
0.53
cold
0.52
quente
0.51
hot
0.50
chaud
0.49
warme
0.49
cold
0.49
POSITIVE LOGITS
LOSS
0.47
loss
0.46
hyper
0.45
્ન
0.43
impaired
0.42
impairs
0.42
Loss
0.42
hypoglycemia
0.42
Loss
0.41
paradoxical
0.40
Activations Density 0.026%