INDEX
Explanations
details related to numerical specifications or instructions
New Auto-Interp
Negative Logits
ãĥį
-0.82
lyn
-0.81
roo
-0.74
ãĥ¯
-0.71
Ron
-0.71
without
-0.67
quished
-0.67
Susan
-0.66
umerable
-0.65
ļéĨĴ
-0.64
POSITIVE LOGITS
severity
0.95
circumstances
0.86
circumstance
0.85
uncture
0.76
type
0.73
subclass
0.72
luck
0.71
recipient
0.69
disposition
0.69
scenario
0.69
Activations Density 0.107%