INDEX
    Explanations

    references to TV shows and news programs

    New Auto-Interp
    Negative Logits
    tagext
    -0.49
     agu
    -0.48
     truncate
    -0.47
    λου
    -0.47
     Hald
    -0.46
    ..\..\
    -0.45
    بوابة
    -0.44
    úgó
    -0.44
    amazonaws
    -0.44
     telegraph
    -0.44
    POSITIVE LOGITS
    <bos>
    0.61
     transfieras
    0.61
     betweenstory
    0.59
    IndentedString
    0.54
    tagHelper
    0.53
     Italijani
    0.53
     Paglinawan
    0.52
     transcur
    0.51
    irchen
    0.50
    elemField
    0.49
    Act Density 0.611%

    No Known Activations