INDEX
Explanations
phrases instructing to share or email content
references to the word "this."
New Auto-Interp
Negative Logits
aneers
-0.69
alties
-0.69
ا
-0.68
owers
-0.68
cases
-0.67
Examples
-0.66
roots
-0.64
ono
-0.63
ceilings
-0.63
RESULTS
-0.62
POSITIVE LOGITS
article
1.10
item
0.94
slideshow
0.84
page
0.83
ARTICLE
0.83
document
0.82
site
0.81
webpage
0.79
article
0.78
essay
0.75
Activations Density 0.075%