INDEX
Explanations
the word "do" in sentences
repetitive phrases indicating agreement or affirmation
New Auto-Interp
Negative Logits
Madagascar
-0.69
case
-0.65
ufact
-0.64
Entered
-0.62
hog
-0.62
SAR
-0.62
Printed
-0.62
aro
-0.61
Handling
-0.60
Hok
-0.59
POSITIVE LOGITS
vet
0.92
pez
0.89
apologise
0.86
understand
0.85
ppel
0.84
intend
0.83
berman
0.83
acknowledge
0.82
apologize
0.82
hereby
0.81
Activations Density 0.092%