INDEX
Explanations
the word "do" followed by actions or beliefs
instances of the phrase "I do."
New Auto-Interp
Negative Logits
Handling
-0.75
fields
-0.69
Entered
-0.63
hog
-0.63
Madagascar
-0.62
Rah
-0.62
case
-0.61
ware
-0.60
Ages
-0.60
Pyongyang
-0.59
POSITIVE LOGITS
pez
0.99
omsday
0.92
ppel
0.91
herty
0.84
女
0.84
lez
0.83
oley
0.82
vet
0.82
ggy
0.78
onga
0.77
Activations Density 0.114%