INDEX
    Explanations

    references to specific places or locations

    words related to specific names or entities, particularly proper nouns

    New Auto-Interp
    Negative Logits
    WAR
    -0.72
    gage
    -0.72
    kies
    -0.70
    dies
    -0.70
    venge
    -0.67
    CAP
    -0.66
    gard
    -0.66
    MET
    -0.65
    sea
    -0.65
     è£ı
    -0.65
    POSITIVE LOGITS
    imb
    1.13
    odies
    0.91
    ilib
    0.84
    oche
    0.81
    ead
    0.79
    orts
    0.78
    uilt
    0.78
    ospital
    0.77
    edience
    0.76
    amate
    0.75
    Act Density 0.012%

    No Known Activations