INDEX
Explanations
specific numbers or numerical patterns within the text
New Auto-Interp
Negative Logits
-0.75
↵↵↵↵↵
-0.72
Madura
-0.72
PROM
-0.69
Moc
-0.68
scrollPane
-0.68
◗
-0.68
CDP
-0.67
virt
-0.67
Kasper
-0.66
POSITIVE LOGITS
1
0.91
9
0.84
UNRELATED
0.81
eleven
0.79
7
0.77
6
0.77
nakalista
0.77
5
0.76
lys
0.73
eleven
0.72
Activations Density 0.182%