INDEX
Explanations
phrases related to an ideal state or concept
expressions of the concept of "ideal."
New Auto-Interp
Negative Logits
bane
-0.82
ktop
-0.77
rough
-0.73
been
-0.71
weak
-0.69
hani
-0.69
upon
-0.68
worthiness
-0.68
SEE
-0.67
SPONSORED
-0.67
POSITIVE LOGITS
istically
1.05
imates
1.03
istic
1.02
yip
0.86
embodiment
0.85
imum
0.82
ideal
0.80
igslist
0.78
representation
0.77
ized
0.74
Activations Density 0.010%