INDEX
Explanations
words relating to communication or expression, especially emphasizing the act of conveying messages or meaning
language related to conveying messages or meanings
New Auto-Interp
Negative Logits
ppo
-0.73
CRC
-0.70
udo
-0.70
olicy
-0.69
Patch
-0.67
Alliance
-0.65
Kier
-0.65
rance
-0.65
itsu
-0.64
BUG
-0.64
POSITIVE LOGITS
convey
3.86
conve
2.40
conveyed
2.39
express
1.29
relay
1.07
communicate
1.01
communicates
1.00
impart
0.98
communicating
0.97
communicated
0.96
Activations Density 0.032%