INDEX
Explanations
proper nouns, especially related to locations and people
significant entities, such as locations, people, and organizations mentioned in news or reports
New Auto-Interp
Negative Logits
=================================================================
-0.58
================================================================
-0.57
Canaver
-0.55
Dialogue
-0.54
Tolkien
-0.53
Loft
-0.52
Seah
-0.50
Historic
-0.50
.).
-0.49
'.
-0.49
POSITIVE LOGITS
omach
0.46
routed
0.45
ÃĥÃĤ
0.45
emale
0.44
physical
0.44
ecause
0.43
Ö¼
0.43
physically
0.42
overpower
0.42
utterstock
0.41
Activations Density 2.210%