INDEX
    Explanations

    proper nouns and specific terms in various contexts

    Hindi, Japanese, technical abbreviations

    New Auto-Interp
    Negative Logits
    UnusedPrivate
    -0.81
     resourceCulture
    -0.76
    genodigd
    -0.70
    ロウィン
    -0.69
    новништво
    -0.68
     StyleSheet
    -0.67
     Walkover
    -0.66
     للاسماء
    -0.64
     ViewPager
    -0.63
     endwhile
    -0.63
    POSITIVE LOGITS
     यह
    0.82
     یہ
    0.73
    यह
    0.70
     वह
    0.65
     ये
    0.62
     وہ
    0.55
     वो
    0.50
     जो
    0.44
     वे
    0.42
    これが
    0.39
    Act Density 0.004%

    No Known Activations