INDEX
Explanations
numeric values and variable identifiers commonly used in programming or mathematical expressions
New Auto-Interp
Negative Logits
`]
-0.49
Y
-0.49
-0.48
class
-0.47
company
-0.47
sex
-0.46
ixx
-0.46
yki
-0.46
L
-0.45
]}>
-0.44
POSITIVE LOGITS
Jurí
0.84
myſelf
0.82
Normdatei
0.79
ItemBackground
0.75
pleaſure
0.74
Theſe
0.74
IsMutable
0.74
himſelf
0.73
těte
0.73
purpoſe
0.73
Activations Density 0.823%