INDEX
Explanations
proper nouns, specifically names
New Auto-Interp
Negative Logits
DockStyle
-0.83
ArrowToggle
-0.73
initComponents
-0.65
تضيفلها
-0.64
########.
-0.62
Dorothy
-0.62
Hilda
-0.61
Susan
-0.61
carol
-0.61
Dorothy
-0.61
POSITIVE LOGITS
Cade
0.79
Cade
0.73
Caleb
0.71
Luke
0.69
__))
0.69
Dylan
0.69
Kade
0.67
Drew
0.67
Ethan
0.66
Caleb
0.65
Activations Density 0.452%