INDEX
Explanations
words related to physical temperament or temperature
references to "temper" and related terms
New Auto-Interp
Negative Logits
roma
-0.87
rd
-0.72
location
-0.69
km
-0.69
gom
-0.68
rawl
-0.67
HEAD
-0.66
nz
-0.66
ÅĤ
-0.65
NX
-0.65
POSITIVE LOGITS
tant
1.29
temper
1.25
amental
1.22
aments
1.05
Temper
1.02
tempered
0.95
climates
0.87
pit
0.86
defe
0.82
temperament
0.81
Activations Density 0.011%