INDEX
Explanations
phrases related to furniture
references to pieces of furniture
New Auto-Interp
Negative Logits
aneously
-0.67
Ibid
-0.64
irgin
-0.63
ounty
-0.63
Violent
-0.63
————
-0.62
awar
-0.62
mary
-0.61
ml
-0.61
olics
-0.61
POSITIVE LOGITS
furniture
1.33
iture
1.01
decoration
0.99
decorations
0.94
chairs
0.94
drawer
0.89
Furn
0.81
Crate
0.80
ronics
0.77
pillar
0.76
Activations Density 0.019%