INDEX
Explanations
information regarding marriages
instances of the word "married" in various contexts
New Auto-Interp
Negative Logits
Flavoring
-0.98
nesota
-0.77
olin
-0.77
urg
-0.75
vernment
-0.73
abwe
-0.73
ostic
-0.72
atics
-0.72
affer
-0.70
opia
-0.69
POSITIVE LOGITS
nesday
0.99
married
0.82
ton
0.77
marry
0.77
divorced
0.75
couples
0.73
married
0.70
equality
0.70
marrying
0.70
divorce
0.70
Activations Density 0.038%