INDEX
Explanations
instances of phrases or clauses expressing criticism or disagreement
the adverb "just" in various contexts
New Auto-Interp
Negative Logits
lore
-0.72
eers
-0.63
ahime
-0.59
Nurs
-0.58
Archdemon
-0.57
apolis
-0.57
appell
-0.56
pora
-0.56
nation
-0.55
ught
-0.54
POSITIVE LOGITS
ifications
1.16
ifiable
1.11
cause
0.88
itia
0.88
because
0.87
because
0.86
if
0.84
ifying
0.81
inian
0.78
desserts
0.78
Activations Density 0.067%