INDEX
Explanations
words or phrases indicating a choice between two options
references to binary or dual concepts
New Auto-Interp
Negative Logits
MSN
-0.76
matter
-0.72
actionDate
-0.68
vic
-0.68
Unt
-0.68
utsche
-0.66
pat
-0.64
zee
-0.64
shed
-0.63
Spectre
-0.63
POSITIVE LOGITS
dozen
0.76
factors
0.74
quir
0.70
options
0.68
teenth
0.68
reasons
0.67
choices
0.65
finalists
0.65
hemisphere
0.65
arios
0.65
Activations Density 0.048%