INDEX
Explanations
instances where someone is explaining or noting something to others
instances where explanations and assertions are being made
New Auto-Interp
Negative Logits
Ther
-0.65
raq
-0.59
held
-0.56
peg
-0.56
respect
-0.55
icum
-0.54
bish
-0.54
backer
-0.53
aic
-0.53
Pont
-0.53
POSITIVE LOGITS
©¶æ
0.84
"â̦
0.83
"[
0.81
"...
0.81
-+-+
0.71
'[
0.70
"(
0.69
=\"
0.67
quoting
0.65
although
0.64
Activations Density 0.257%