INDEX
    Explanations

    references to character attributes and relational dynamics within narratives or discussions

    Following "is," "are," or "were."

    New Auto-Interp
    Negative Logits
    SourceChecksum
    -0.47
    хьтан
    -0.45
    CreateMap
    -0.42
    toHaveBeenCalled
    -0.42
    OOTDTY
    -0.39
    dagog
    -0.36
    OGND
    -0.35
    المناصب
    -0.35
    UnknownFields
    -0.35
     LEYENDO
    -0.35
    POSITIVE LOGITS
     vielmehr
    0.65
    むしろ
    0.56
     instead
    0.54
    Instead
    0.54
    AnchorTagHelper
    0.52
     Instead
    0.52
    あく
    0.52
    あくまで
    0.48
    而是
    0.48
    instead
    0.47
    Act Density 0.442%

    No Known Activations