INDEX
Explanations
terms related to some kind of primary aspect or focus
the term "primary" in various contexts related to significance or ranking
New Auto-Interp
Negative Logits
uge
-0.73
estern
-0.71
INESS
-0.70
glass
-0.68
EEK
-0.68
Doodle
-0.67
Feet
-0.66
phis
-0.66
sung
-0.66
LV
-0.64
POSITIVE LOGITS
careg
0.91
tenance
0.88
beneficiary
0.85
antagonist
0.84
responsibility
0.78
culprit
0.78
iary
0.76
ities
0.74
distingu
0.74
pivot
0.74
Activations Density 0.018%