INDEX
Explanations
transitional phrases indicating progression or movement in a sequence or discussion
transitions in topics or sections of text
New Auto-Interp
Negative Logits
ãĥ´
-0.69
boro
-0.67
ardless
-0.66
sov
-0.65
ovan
-0.64
aren
-0.63
unta
-0.61
+(
-0.61
isle
-0.61
arta
-0.60
POSITIVE LOGITS
basics
0.90
Conclusion
0.86
â̦]
0.82
specifics
0.77
topic
0.76
whats
0.74
juicy
0.74
topic
0.73
explan
0.72
Discussion
0.71
Activations Density 0.314%