INDEX
Explanations
logical reasoning and related concepts
references to logic and reasoning
New Auto-Interp
Negative Logits
avez
-0.72
Volunte
-0.70
Ago
-0.66
enegger
-0.66
Settlement
-0.66
elcome
-0.64
hold
-0.64
Veterans
-0.64
Banner
-0.63
eligible
-0.63
POSITIVE LOGITS
logic
0.96
dictates
0.95
underpin
0.94
reasoning
0.92
matical
0.92
fallacy
0.90
justifying
0.86
logically
0.82
analy
0.82
istically
0.80
Activations Density 0.046%