INDEX
Explanations
words related to family and relationships
references to friends and relatives
New Auto-Interp
Negative Logits
Cola
-0.77
lite
-0.77
yss
-0.73
Effective
-0.71
idth
-0.68
Indust
-0.67
Loch
-0.65
Vert
-0.65
aughty
-0.64
OY
-0.62
POSITIVE LOGITS
hips
1.07
relatives
0.99
whom
0.93
deceased
0.90
abroad
0.89
folk
0.88
who
0.83
caregivers
0.83
grieving
0.82
confid
0.78
Activations Density 0.094%