INDEX
Explanations
phrases indicating anticipation or future events
New Auto-Interp
Negative Logits
åĮ
-0.18
anon
-0.18
comes
-0.15
lluminate
-0.15
come
-0.15
anter
-0.15
uce
-0.15
ÏĦÏİ
-0.15
416
-0.14
goes
-0.14
POSITIVE LOGITS
com
0.44
-com
0.44
com
0.43
COM
0.41
Com
0.38
_com
0.36
comm
0.35
kom
0.34
coming
0.34
Coming
0.33
Activations Density 0.044%