INDEX
Explanations
conclusive statements or transitions in reasoning
introducing a conclusion
New Auto-Interp
Negative Logits
anteen
-0.57
erview
-0.52
Badge
-0.52
Bowling
-0.52
Ventilation
-0.51
Lint
-0.50
Auditor
-0.50
⒝
-0.50
ſeine
-0.50
telemetry
-0.49
POSITIVE LOGITS
Thus
1.39
Thus
1.37
Ainsi
0.89
Ainsi
0.87
Hence
0.76
Hence
0.76
Therefore
0.73
Therefore
0.71
thus
0.62
thus
0.60
Activations Density 0.016%