INDEX
Explanations
references to couples
instances of the word "couple"
New Auto-Interp
Negative Logits
schild
-0.73
umbn
-0.67
ibaba
-0.67
DERR
-0.65
é¾
-0.64
éļ
-0.64
Directorate
-0.62
Flavoring
-0.62
resso
-0.59
Glob
-0.58
POSITIVE LOGITS
divorced
0.99
married
0.98
wed
0.95
divor
0.91
hood
0.87
riages
0.85
couples
0.84
maid
0.83
ndra
0.82
ter
0.78
Activations Density 0.027%