INDEX
Explanations
proper nouns and verbs related to news and events
content related to disciplinary actions and accusations
New Auto-Interp
Negative Logits
ntil
-0.61
soDeliveryDate
-0.61
bilt
-0.57
MpServer
-0.55
¶ħ
-0.53
Slate
-0.52
DragonMagazine
-0.51
ãĤ¨ãĥ«
-0.51
<?
-0.50
quote
-0.50
POSITIVE LOGITS
arrives
0.62
unexpectedly
0.61
enters
0.59
mysteriously
0.59
arrived
0.58
erupted
0.58
became
0.57
becomes
0.57
began
0.56
came
0.56
Activations Density 1.002%