INDEX
Explanations
proper nouns, possibly related to controversies or conflicts
mentions of specific names and terms related to people or entities in context
New Auto-Interp
Negative Logits
esthetic
-0.98
ily
-0.83
esthesia
-0.82
meric
-0.80
eco
-0.75
ijk
-0.73
iland
-0.73
erm
-0.72
ilic
-0.72
java
-0.72
POSITIVE LOGITS
aneous
0.96
Cumber
0.84
Lauder
0.81
hyde
0.80
aire
0.77
batch
0.76
ative
0.75
ror
0.70
FORE
0.70
aneously
0.69
Activations Density 0.190%