INDEX
Explanations
prepositions or conjunctions used to introduce explanations or references
phrases indicating contexts of public discussion or discourse
New Auto-Interp
Negative Logits
hov
-0.87
romeda
-0.80
etheless
-0.75
BSD
-0.71
frog
-0.71
ochet
-0.71
issance
-0.70
dor
-0.69
oshi
-0.67
rolet
-0.66
POSITIVE LOGITS
interviews
1.33
speeches
1.30
forums
1.29
conversations
1.17
polite
1.12
debates
1.11
contexts
1.11
conversation
1.09
public
1.08
presentations
1.06
Activations Density 0.348%