INDEX
Explanations
relationships between individuals
references to friends and acquaintances in various contexts
New Auto-Interp
Negative Logits
ooks
-0.80
ories
-0.77
instruments
-0.73
Frames
-0.72
poons
-0.71
bows
-0.71
eeds
-0.70
ernels
-0.70
calendars
-0.69
ovies
-0.69
POSITIVE LOGITS
who
1.25
whom
1.16
whose
1.10
named
1.10
who
1.00
whose
0.91
friend
0.90
nicknamed
0.84
classmate
0.83
lier
0.81
Activations Density 0.262%