INDEX
    Explanations

    phrases indicating a basis for reasoning or conclusions

    New Auto-Interp
    Negative Logits
    -0.74
    TagHelper
    -0.61
     accesorios
    -0.60
    bravo
    -0.59
     MonoBehaviour
    -0.56
     utafitiHapana
    -0.56
    JvmStatic
    -0.55
     acessórios
    -0.54
    Tikang
    -0.53
    archiviato
    -0.52
    POSITIVE LOGITS
    Based
    0.72
     beruht
    0.71
     Based
    0.69
    ">/
    0.68
    BASED
    0.67
    dasarkan
    0.67
     tiré
    0.67
    基于
    0.66
    に基
    0.65
     based
    0.64
    Act Density 0.358%

    No Known Activations