INDEX
    Explanations

    phrases that denote foundational principles or claims

    New Auto-Interp
    Negative Logits
    RegressionTest
    -0.65
    ofollow
    -0.57
    mphony
    -0.57
    nezeu
    -0.55
     initComponents
    -0.55
    ContentAsync
    -0.55
    -0.55
     giveaways
    -0.54
    PullParser
    -0.53
     TAGS
    -0.52
    POSITIVE LOGITS
     beruht
    0.59
     basado
    0.52
     basada
    0.49
     based
    0.49
     base
    0.48
     baseado
    0.48
     rely
    0.46
     basadas
    0.46
     basis
    0.45
    base
    0.44
    Act Density 0.032%

    No Known Activations