INDEX
Explanations
proper nouns and organizations
combinations of punctuation and specific words, indicating a focus on lists or enumerations
New Auto-Interp
Negative Logits
nih
-0.74
osc
-0.71
anya
-0.65
lift
-0.64
ocalypse
-0.64
anski
-0.64
animate
-0.62
oscopic
-0.62
aven
-0.61
ãĥ¥
-0.60
POSITIVE LOGITS
meanwhile
1.07
flanked
0.88
accompanied
0.83
however
0.82
citing
0.82
alas
0.81
fearing
0.81
backed
0.81
unsurprisingly
0.78
albeit
0.73
Activations Density 0.291%