INDEX
Explanations
adjectives or nouns denoting deep significance or purpose
concepts related to the idea of "meaning."
New Auto-Interp
Negative Logits
aliation
-0.69
Citiz
-0.67
osterone
-0.67
iets
-0.67
asting
-0.66
avorite
-0.65
Frazier
-0.65
asted
-0.63
DERR
-0.61
emetery
-0.59
POSITIVE LOGITS
fully
1.65
ful
1.38
lessness
1.32
fulness
1.25
lessly
1.02
ual
0.98
sworth
0.96
istically
0.96
FUL
0.95
full
0.92
Activations Density 0.034%