INDEX
Explanations
instances of the word "married"
references to the concept of being married
New Auto-Interp
Negative Logits
anwhile
-0.84
umbn
-0.75
Flavoring
-0.72
urg
-0.72
illin
-0.72
ombies
-0.66
acco
-0.66
emonic
-0.65
Oo
-0.65
EMS
-0.63
POSITIVE LOGITS
nesday
0.93
couples
0.93
married
0.93
marry
0.83
equality
0.78
equality
0.77
bachelor
0.76
divor
0.74
hood
0.72
tons
0.72
Activations Density 0.015%