INDEX
Explanations
the word "it" followed by a verb
the negation of actions or statements
New Auto-Interp
Negative Logits
Skydragon
-0.70
arsen
-0.70
izons
-0.69
Maiden
-0.64
Houses
-0.63
Ĥ¬
-0.61
Nile
-0.59
Presence
-0.58
Koen
-0.58
izont
-0.58
POSITIVE LOGITS
ogether
0.93
actly
0.92
ember
0.86
¨
0.86
withstanding
0.82
prise
0.76
necessarily
0.76
ymes
0.75
surpr
0.70
ournament
0.70
Activations Density 0.091%