INDEX
Explanations
efficiency and quality metrics
New Auto-Interp
Negative Logits
affectionate
0.46
intensa
0.43
Resistant
0.42
indulgent
0.41
intolerant
0.40
permeable
0.40
intenso
0.40
tolerant
0.39
arrogant
0.39
incompetent
0.39
POSITIVE LOGITS
reliability
2.14
stability
1.91
efficiency
1.90
accuracy
1.86
usability
1.86
reliability
1.82
readability
1.79
effectiveness
1.73
correctness
1.71
clarity
1.66
Activations Density 0.570%