INDEX
Explanations
phrases and connectors that indicate complexity and nuance in discussions around social and institutional issues
New Auto-Interp
Negative Logits
leta
-0.14
aan
-0.14
abb
-0.14
eus
-0.13
ine
-0.13
275
-0.13
apis
-0.13
useClass
-0.13
TL
-0.13
rosse
-0.13
POSITIVE LOGITS
etc
0.39
etc
0.32
/etc
0.24
çŃī
0.20
ÑĤоÑīо
0.17
finally
0.17
whatever
0.16
blah
0.16
çŃī
0.16
iferay
0.16
Activations Density 0.076%