INDEX
Explanations
the word "does" followed by another word, implying actions being performed or not performed
the phrase "does not" indicating negation or contrast in statements
New Auto-Interp
Negative Logits
boarding
-0.80
estern
-0.65
Seasons
-0.63
ansas
-0.62
isu
-0.62
palms
-0.61
Learns
-0.61
este
-0.59
Ready
-0.58
Printed
-0.58
POSITIVE LOGITS
vet
1.17
NOT
1.05
oms
0.97
not
0.97
nothing
0.94
exist
0.92
oming
0.89
indeed
0.87
seem
0.87
occur
0.87
Activations Density 0.065%