INDEX
Explanations
specific names or entities mentioned in the text
references to specific individuals involved in a narrative or incident
New Auto-Interp
Negative Logits
Ranked
-0.79
wikipedia
-0.73
aternity
-0.72
duc
-0.70
naire
-0.69
mount
-0.69
çĦ
-0.68
ĸļ
-0.68
raising
-0.66
ebook
-0.65
POSITIVE LOGITS
Toro
1.16
Strait
0.80
ettel
0.75
ulla
0.75
Elliot
0.73
Lumpur
0.73
inez
0.73
peppers
0.73
-------
0.72
IELD
0.70
Activations Density 0.011%