INDEX
Explanations
nouns representing physical objects or concepts
phrases containing the word "piece" used in various contexts
New Auto-Interp
Negative Logits
gyn
-0.72
essors
-0.71
ubs
-0.71
uations
-0.69
raints
-0.66
uers
-0.65
riers
-0.64
oons
-0.63
riages
-0.63
esses
-0.63
POSITIVE LOGITS
legislation
0.84
scenery
0.83
cloth
0.82
gum
0.77
machinery
0.77
trivia
0.76
luck
0.75
equipment
0.75
cake
0.75
wreckage
0.74
Activations Density 0.069%