INDEX
Explanations
repeated use of the word "same" in texts
instances of the phrase "do the same."
New Auto-Interp
Negative Logits
mates
-0.72
Provided
-0.71
Lauder
-0.66
UST
-0.66
Built
-0.63
mate
-0.61
Leilan
-0.61
Advis
-0.60
Reborn
-0.60
Frag
-0.59
POSITIVE LOGITS
thing
1.09
same
0.85
vein
0.83
amount
0.74
exact
0.71
worldly
0.70
happen
0.70
deed
0.69
ials
0.68
aldi
0.68
Activations Density 0.022%