INDEX
Explanations
French language text and words.
words related to war, literature, and industry
phrases related to societal critique or observation
New Auto-Interp
Negative Logits
Arizona
-0.90
baugh
-0.87
Mariners
-0.79
Hispanic
-0.77
Seattle
-0.77
arijuana
-0.75
Uzbek
-0.75
Redd
-0.75
amsung
-0.75
antha
-0.74
POSITIVE LOGITS
Ré
1.47
Ãī
1.40
Ãł
1.39
Franç
1.38
France
1.37
François
1.35
Qué
1.33
Rouge
1.32
Hollande
1.31
Paris
1.31
Activations Density 0.502%