INDEX
Explanations
characters formatting changes in the text
phrases indicating diplomatic or international relations contexts
New Auto-Interp
Negative Logits
Oaks
-0.69
hindsight
-0.65
homebrew
-0.63
Whale
-0.62
utilizing
-0.61
contemplating
-0.61
abouts
-0.61
Hancock
-0.60
obscurity
-0.59
concussion
-0.59
POSITIVE LOGITS
Shape
0.86
Egypt
0.86
Foreign
0.83
AFP
0.82
SPONSORED
0.81
Scroll
0.81
Turkey
0.80
Saudi
0.79
Turkish
0.78
PHOTOS
0.77
Activations Density 0.259%