INDEX
Explanations
references to communication and responses
New Auto-Interp
Negative Logits
ipple
-0.16
washer
-0.14
usters
-0.14
/Gate
-0.14
ĥģ
-0.13
ÅĻit
-0.13
scribe
-0.13
Robertson
-0.13
Bey
-0.13
ê¶ģ
-0.13
POSITIVE LOGITS
receive
0.79
receiving
0.73
received
0.73
receives
0.72
receipt
0.71
Receive
0.67
rece
0.65
receive
0.65
Rece
0.64
Received
0.63
Activations Density 0.153%