INDEX
    Explanations

    proper nouns, especially related to locations and people

    significant entities, such as locations, people, and organizations mentioned in news or reports

    New Auto-Interp
    Negative Logits
     =================================================================
    -0.58
    ================================================================
    -0.57
     Canaver
    -0.55
    Dialogue
    -0.54
     Tolkien
    -0.53
     Loft
    -0.52
     Seah
    -0.50
     Historic
    -0.50
    .).
    -0.49
    '.
    -0.49
    POSITIVE LOGITS
    omach
    0.46
     routed
    0.45
    ÃĥÃĤ
    0.45
    emale
    0.44
    physical
    0.44
    ecause
    0.43
    Ö¼
    0.43
     physically
    0.42
     overpower
    0.42
    utterstock
    0.41
    Act Density 2.210%

    No Known Activations