INDEX
Explanations
contrasting statements that feature the word "but"
conjunctions that introduce contrasting or adversative statements
New Auto-Interp
Negative Logits
dayName
-0.73
abe
-0.71
\":
-0.71
AU
-0.70
onomy
-0.69
sylvania
-0.68
)|
-0.67
orts
-0.66
endar
-0.65
kamp
-0.64
POSITIVE LOGITS
tons
1.34
alas
1.22
chers
0.98
luckily
0.90
hey
0.90
somehow
0.88
fortunately
0.87
unfortunately
0.86
beware
0.85
mirac
0.85
Activations Density 0.279%