INDEX
    Explanations

    company names in text

    negations or statements disproving a claim

    New Auto-Interp
    Negative Logits
     Palest
    -0.71
     photoc
    -0.67
    çīĪ
    -0.65
    ilst
    -0.62
     indo
    -0.62
     sacrific
    -0.61
     cryptoc
    -0.61
     Fukushima
    -0.60
     Nik
    -0.60
     mainland
    -0.59
    POSITIVE LOGITS
    ¬
    1.32
    Ń
    1.27
    ±
    1.20
    «
    1.18
    ĵ
    1.16
    ĸ
    1.15
    ij
    1.14
    ª
    1.13
    £
    1.12
    Ķ
    1.12
    Act Density 0.136%

    No Known Activations