INDEX
Explanations
words indicating additional information or referencing further reading material
occurrences of the term "further" along with references to additional information
New Auto-Interp
Negative Logits
hers
-0.74
Ïī
-0.67
theirs
-0.66
snatch
-0.65
paddle
-0.64
iren
-0.64
endered
-0.62
slack
-0.62
fishing
-0.61
itably
-0.61
POSITIVE LOGITS
Examples
1.03
Locations
1.00
Reports
0.97
Stories
0.93
Quote
0.93
Advice
0.93
Problems
0.93
Issues
0.92
Thoughts
0.92
Analysis
0.92
Activations Density 0.071%