INDEX
Explanations
phrases related to specific events or actions in a structured format akin to news headlines
references to specific events or actions tied to military or investigative themes
New Auto-Interp
Negative Logits
âĢ
-0.76
âĢ
-0.73
ãĢį
-0.73
ãĢ
-0.70
AMY
-0.69
--------
-0.68
ðŁij
-0.66
âĿ
-0.65
another
-0.63
ô
-0.63
POSITIVE LOGITS
brut
0.64
landfill
0.60
rubbish
0.56
outlandish
0.55
cramped
0.55
impractical
0.54
660
0.54
genitals
0.53
brute
0.52
postage
0.52
Activations Density 1.282%