INDEX
Explanations
phrases indicating a contrast or clarification in language
the phrase "not that" and its variations emphasizing distinctions or clarifications
New Auto-Interp
Negative Logits
aukee
-0.69
lain
-0.68
oplan
-0.67
apse
-0.67
intosh
-0.64
pta
-0.62
mun
-0.61
elin
-0.61
antha
-0.60
MODE
-0.59
POSITIVE LOGITS
expecting
0.71
kidding
0.65
anybody
0.64
anyone
0.63
coincidence
0.62
vernment
0.62
fancy
0.61
mention
0.60
anymore
0.60
Loan
0.60
Activations Density 0.096%