INDEX
Explanations
instances of the word "same" or variations thereof
same followed by qualifier
New Auto-Interp
Negative Logits
popo
-0.47
PDC
-0.47
Uro
-0.46
hackers
-0.45
palanca
-0.44
povol
-0.44
PPC
-0.44
κος
-0.43
BPI
-0.43
fptr
-0.43
POSITIVE LOGITS
same
1.33
Same
1.32
Same
1.23
same
1.20
SAME
1.08
SAME
1.07
zelfde
0.92
isSame
0.88
mesmas
0.85
samma
0.81
Activations Density 0.040%