INDEX
Explanations
phrases containing the word "paper"
occurrences of the word "paper" and its variants
New Auto-Interp
Negative Logits
ß
-0.69
tery
-0.67
otions
-0.65
fitting
-0.65
amping
-0.64
Christie
-0.62
inately
-0.62
otomy
-0.61
ting
-0.61
kw
-0.61
POSITIVE LOGITS
clips
1.01
ILY
0.89
apers
0.87
formance
0.82
aper
0.82
clip
0.74
ONS
0.70
strip
0.68
enegger
0.68
IUM
0.67
Activations Density 0.047%