INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     coup
    -0.08
     Colony
    -0.07
    208
    -0.06
     saldırı
    -0.06
    _winner
    -0.06
     düzen
    -0.06
     victory
    -0.06
     divers
    -0.06
    function
    -0.06
     fort
    -0.06
    POSITIVE LOGITS
     read
    0.14
     Read
    0.13
    -read
    0.12
     reading
    0.12
     reads
    0.11
     READ
    0.10
     Reads
    0.10
     readings
    0.10
    read
    0.10
    	read
    0.09
    Act Density 0.055%

    No Known Activations