INDEX
Explanations
mentions indicating responsibility or action
instances of empty or missing content
New Auto-Interp
Negative Logits
millenn
-1.19
Instr
-0.98
govtrack
-0.97
enthusi
-0.92
iosyn
-0.86
aditional
-0.79
comr
-0.78
ikuman
-0.77
DragonMagazine
-0.76
luaj
-0.76
POSITIVE LOGITS
<|endoftext|>
1.06
ignty
0.91
pmwiki
0.84
ð
0.84
etsk
0.84
©¶æ
0.82
rawdownloadcloneembedreportprint
0.81
ebook
0.79
wcsstore
0.78
namese
0.77
Activations Density 0.782%