INDEX
Explanations
statements or opinions expressed in written form
phrases related to communication, dialogue, and the sharing of information
New Auto-Interp
Negative Logits
ayne
-0.70
asus
-0.68
capacity
-0.62
audi
-0.62
berus
-0.61
ossus
-0.59
senal
-0.58
bidder
-0.57
Roses
-0.57
orest
-0.56
POSITIVE LOGITS
about
1.54
ABOUT
1.46
about
1.17
About
1.14
aloud
1.14
anonymously
1.09
regarding
1.06
concerning
1.02
pertaining
0.98
orally
0.98
Activations Density 0.563%