INDEX
Explanations
words related to governmental policies and procedures
New Auto-Interp
Negative Logits
oranges
-0.91
Balls
-0.75
aterasu
-0.72
Hera
-0.70
peppers
-0.70
Tears
-0.69
TAMADRA
-0.69
oldemort
-0.69
swe
-0.67
Dracula
-0.66
POSITIVE LOGITS
occupational
0.89
allied
0.88
technical
0.88
sonian
0.88
aerospace
0.88
environmental
0.82
interpersonal
0.81
ultural
0.81
ural
0.81
chemical
0.79
Activations Density 2.036%