INDEX
Explanations
entities like names, titles or organizations in a specific format
punctuation marks and their contexts within citations and quotations
New Auto-Interp
Negative Logits
lling
-0.74
bably
-0.69
arer
-0.67
sulph
-0.65
inadequ
-0.63
forgotten
-0.63
lifes
-0.63
theless
-0.63
oeuv
-0.63
concentration
-0.63
POSITIVE LOGITS
BRE
0.93
KT
0.90
FOX
0.89
SCP
0.88
................................................................
0.86
Anonymous
0.86
WF
0.86
Screenshot
0.85
WARNING
0.85
GREEN
0.83
Activations Density 0.131%