INDEX
Explanations
phrases that indicate expectations and standards of quality
New Auto-Interp
Negative Logits
ajas
-0.14
$MESS
-0.13
omp
-0.13
ÙĬÙĦÙĬ
-0.13
ensen
-0.13
à¥Ĥड
-0.13
Unauthorized
-0.13
rej
-0.13
panor
-0.13
ords
-0.13
POSITIVE LOGITS
perfect
0.83
perfect
0.70
Perfect
0.69
PERF
0.67
Perfect
0.63
ideal
0.59
prefect
0.56
perfection
0.54
ideal
0.52
Ideal
0.49
Activations Density 0.119%