INDEX
Explanations
words indicating logical conclusions or implications
instances of the word "therefore."
New Auto-Interp
Negative Logits
Columb
-0.75
ten
-0.71
Feld
-0.70
Straw
-0.67
Blaz
-0.66
Ashton
-0.64
Lancaster
-0.63
jar
-0.62
Wooden
-0.62
Ventura
-0.61
POSITIVE LOGITS
ĵĺ
0.97
guiActiveUn
0.93
efficients
0.92
therefore
0.91
rehend
0.89
ratulations
0.84
unfocusedRange
0.84
ħĭ
0.84
occas
0.83
Ͻ
0.82
Activations Density 0.008%