INDEX
Explanations
mentions of political figures and entities
special characters or symbols used in various contexts
New Auto-Interp
Negative Logits
guiName
-0.87
Solitaire
-0.76
theless
-0.75
etheless
-0.71
ACTIONS
-0.65
Waves
-0.57
Shroud
-0.57
scattering
-0.56
Ruin
-0.56
Skydragon
-0.56
POSITIVE LOGITS
*)
1.54
>)
1.54
.).
1.49
!).
1.44
).
1.42
!)
1.41
?).
1.40
!),
1.38
)."
1.37
)!
1.35
Activations Density 0.054%