INDEX
Explanations
phrases indicating emphasis or importance
the phrase "that," indicating emphasis on specific points or references in the text
New Auto-Interp
Negative Logits
apolis
-0.72
istics
-0.67
YS
-0.67
heid
-0.67
tones
-0.66
izont
-0.64
istor
-0.61
emis
-0.61
ington
-0.61
ophers
-0.61
POSITIVE LOGITS
includes
1.02
translates
0.99
leads
0.95
culminated
0.94
happens
0.90
entails
0.90
constitutes
0.89
resulted
0.88
pesky
0.88
manifests
0.87
Activations Density 0.118%