INDEX
Explanations
phrases related to contrasting statements or pieces of information
the word "and" used in a variety of contexts to connect phrases or ideas
New Auto-Interp
Negative Logits
ses
-0.70
Fine
-0.70
Limited
-0.67
":-
-0.67
Glass
-0.66
Hide
-0.64
AU
-0.64
Ur
-0.63
asper
-0.63
kell
-0.63
POSITIVE LOGITS
romeda
0.91
consequently
0.90
preferably
0.89
possibly
0.88
hopefully
0.87
vice
0.87
therefore
0.87
rightly
0.86
optionally
0.86
rightfully
0.84
Activations Density 0.083%