INDEX
Explanations
mentions of the word "Great," particularly in the context of notable places or organizations
New Auto-Interp
Negative Logits
ilk
-0.19
ekk
-0.17
lov
-0.17
othermal
-0.16
รà¸ģ
-0.16
ç
-0.15
ابة
-0.15
ILA
-0.15
ehr
-0.15
iew
-0.14
POSITIVE LOGITS
orex
0.27
atsby
0.24
Britain
0.23
Lakes
0.23
ness
0.23
rex
0.21
gatsby
0.21
Expect
0.21
Barrier
0.20
_expect
0.20
Activations Density 0.019%