INDEX
Explanations
phrases related to friends and family
references to friends and family
New Auto-Interp
Negative Logits
Fram
-0.81
RANT
-0.76
DEBUG
-0.74
ĸļ
-0.74
Reprodu
-0.73
GREEN
-0.72
cision
-0.69
Reconstruction
-0.69
arth
-0.68
monary
-0.68
POSITIVE LOGITS
acquaintances
1.79
relatives
1.50
classmates
1.50
coworkers
1.49
colleagues
1.43
neighbors
1.42
girlfriends
1.40
neighbours
1.36
strangers
1.33
comrades
1.31
Activations Density 0.089%