INDEX
Explanations
phrases related to loyalty or public representation
New Auto-Interp
Negative Logits
forth
-0.82
ppard
-0.72
stood
-0.67
alore
-0.66
spont
-0.64
Borders
-0.64
Genie
-0.62
Overse
-0.61
AFTA
-0.60
Stain
-0.60
POSITIVE LOGITS
actionDate
0.88
][
0.87
]
0.83
TEXT
0.77
"]=>
0.77
];
0.75
guiActive
0.72
guiActiveUnfocused
0.71
><
0.71
async
0.71
Activations Density 0.151%