INDEX
    Explanations

    the followed by abstract nouns

    New Auto-Interp
    Negative Logits
    VIS
    0.41
    Determin
    0.39
    真實
    0.39
     რამდ
    0.39
    真实
    0.38
    šia
    0.38
     радиа
    0.38
     bárm
    0.38
    whats
    0.37
     نوشت
    0.37
    POSITIVE LOGITS
     havoc
    0.74
     impression
    0.70
     sacrifices
    0.63
     compromises
    0.63
     impressions
    0.62
     manner
    0.62
     way
    0.61
     battles
    0.60
     lengths
    0.59
     inroads
    0.59
    Act Density 0.028%

    No Known Activations