INDEX
Explanations
phrases related to family relationships
expressions indicating familial relationships
New Auto-Interp
Negative Logits
nels
-0.83
ngth
-0.77
aceous
-0.76
pport
-0.74
insk
-0.73
AE
-0.73
need
-0.70
force
-0.70
Downloadha
-0.69
tesy
-0.68
POSITIVE LOGITS
immigrants
0.81
billionaire
0.78
slain
0.76
wealthy
0.75
Holocaust
0.74
Cul
0.74
immigrant
0.73
Emmy
0.71
Zeus
0.71
Martha
0.71
Activations Density 0.076%