INDEX
Explanations
numerical quantities or comparatives followed by a verb
instances of the word "more" indicating significant quantities or increases
New Auto-Interp
Negative Logits
ĺħ
-0.83
xtap
-0.82
Conclusion
-0.75
imaru
-0.73
Fram
-0.72
ÃŁ
-0.72
ishops
-0.70
Quote
-0.69
Guard
-0.68
Practices
-0.67
POSITIVE LOGITS
than
1.28
stringent
0.94
importantly
0.85
sophisticated
0.84
than
0.82
Than
0.81
detailed
0.80
expensive
0.79
comprehensive
0.78
ado
0.78
Activations Density 0.139%