INDEX
Explanations
proper nouns
mentions of specific names and entities, particularly individuals and terms related to programming
New Auto-Interp
Negative Logits
opathic
-0.70
ende
-0.69
ãĤµ
-0.66
enhagen
-0.65
©¶æ
-0.65
vernment
-0.65
nels
-0.63
ressor
-0.63
CVE
-0.63
indo
-0.62
POSITIVE LOGITS
Leah
0.83
ighting
0.83
iva
0.82
ÃŃn
0.77
ua
0.77
y
0.74
Mess
0.73
LM
0.73
prints
0.71
borgh
0.70
Activations Density 0.017%