INDEX
    Explanations

    references to familial relationships and personal history

    New Auto-Interp
    Negative Logits
     noDo
    -0.57
    avajillas
    -0.57
    <bos>
    -0.46
    ganu
    -0.45
     rå
    -0.44
     (!_
    -0.43
     or
    -0.42
     Mather
    -0.42
    seers
    -0.42
    rimir
    -0.41
    POSITIVE LOGITS
    ScopeManager
    0.72
     تانيه
    0.71
     pinulongan
    0.71
    
    0.69
    UnitTesting
    0.68
    SBATCH
    0.68
    ollectionView
    0.66
     صوتيه
    0.65
    Попис
    0.64
    contentLoaded
    0.62
    Act Density 0.081%

    No Known Activations