INDEX
Explanations
words related to brains
the concept of "gains" or positive outcomes in various contexts
New Auto-Interp
Negative Logits
Kinnikuman
-0.77
enhagen
-0.72
Ens
-0.70
mercial
-0.69
ophon
-0.69
govtrack
-0.69
olog
-0.68
skelet
-0.67
album
-0.67
nec
-0.67
POSITIVE LOGITS
hene
0.76
ains
0.73
IFT
0.71
pring
0.70
cott
0.68
fo
0.68
wic
0.68
ees
0.67
igue
0.66
ugal
0.65
Activations Density 0.018%