INDEX
Explanations
phrases related to geographic locations
references to specific names, particularly related to places or brands
New Auto-Interp
Negative Logits
Butterfly
-0.76
forth
-0.75
Covenant
-0.70
Ashton
-0.68
WHO
-0.68
Beyond
-0.67
Fever
-0.67
Juliet
-0.66
Karin
-0.66
Continued
-0.65
POSITIVE LOGITS
terior
0.99
aiman
0.96
iple
0.93
monary
0.93
umn
0.91
gur
0.90
uxe
0.88
ftime
0.87
uge
0.84
aimon
0.84
Activations Density 0.030%