INDEX
Explanations
sentences related to discussing the methodology or assumptions of logical reasoning and academic arguments
New Auto-Interp
Negative Logits
Boot
-0.64
Inher
-0.64
dylib
-0.63
hail
-0.61
Strikes
-0.60
Jur
-0.60
Ital
-0.60
ebted
-0.60
Kut
-0.59
watches
-0.59
POSITIVE LOGITS
preferable
1.11
counterproductive
1.11
fraught
1.08
advisable
0.99
frowned
0.95
futile
0.95
problematic
0.94
impractical
0.94
folly
0.91
daunting
0.90
Activations Density 2.503%