INDEX
Explanations
word "lazy" and related terms
references to laziness
New Auto-Interp
Negative Logits
ciation
-0.76
idine
-0.74
rity
-0.73
Trust
-0.71
icipated
-0.70
iations
-0.70
ibel
-0.69
ellation
-0.68
ilateral
-0.67
semble
-0.66
POSITIVE LOGITS
peasants
0.87
glers
0.86
hungry
0.76
bum
0.75
lazy
0.74
cooks
0.73
Hungry
0.73
azy
0.70
zed
0.70
uga
0.69
Activations Density 0.048%