INDEX
Explanations
phrases that present contrasting information or examples
conjunctions and transitional phrases suggesting relationships between ideas
New Auto-Interp
Negative Logits
boro
-0.62
okane
-0.58
doorway
-0.56
clipboard
-0.55
lockdown
-0.54
wordpress
-0.53
ideshow
-0.52
spectator
-0.52
facade
-0.50
picnic
-0.50
POSITIVE LOGITS
however
1.14
furthermore
1.12
moreover
1.07
therefore
1.01
Therefore
1.00
Moreover
0.99
Specifically
0.96
Furthermore
0.92
However
0.83
Specifically
0.81
Activations Density 0.862%