INDEX
Explanations
sentences indicating a cause-and-effect relationship or an outcome following an action
references to a statement, idea, or assertion
New Auto-Interp
Negative Logits
aths
-0.73
velop
-0.73
ickets
-0.71
adle
-0.70
amia
-0.69
tones
-0.68
utical
-0.67
ricanes
-0.66
affles
-0.66
endants
-0.66
POSITIVE LOGITS
contrasts
0.94
culminated
0.92
article
0.91
week
0.89
discrepancy
0.89
includes
0.88
begs
0.87
means
0.86
slideshow
0.86
isn
0.85
Activations Density 0.151%