INDEX
Explanations
phrases related to communication or contact between individuals
references to contact and communication with individuals
New Auto-Interp
Negative Logits
rawdownloadcloneembedreportprint
-0.76
Temperature
-0.65
ILCS
-0.64
dominates
-0.60
Execution
-0.60
Tsukuyomi
-0.57
edge
-0.56
consumed
-0.55
eaten
-0.54
Category
-0.54
POSITIVE LOGITS
asking
1.09
requesting
1.07
regarding
0.96
privately
0.94
via
0.93
urgently
0.92
directly
0.91
inquire
0.89
anonymously
0.89
inappropriately
0.88
Activations Density 0.143%