INDEX
Explanations
mentions of family relationships and life events
references to familial relationships and family dynamics
New Auto-Interp
Negative Logits
takedown
-0.78
forcement
-0.73
ItemTracker
-0.72
LEVEL
-0.70
flashlight
-0.69
redd
-0.68
XP
-0.66
rack
-0.66
imaru
-0.65
sabotage
-0.65
POSITIVE LOGITS
divorced
1.51
divor
1.39
daughter
1.34
daughters
1.33
grandchildren
1.32
divorce
1.29
married
1.28
widow
1.24
wife
1.22
granddaughter
1.18
Activations Density 0.424%