INDEX
Explanations
parentheses and numbers, especially when used in a legal context
occurrences of parentheses or other bracketed groupings
New Auto-Interp
Negative Logits
utenberg
-0.65
hope
-0.58
emis
-0.55
ÂŃ
-0.54
¬¼
-0.54
fears
-0.53
tongues
-0.51
uca
-0.51
fear
-0.51
joking
-0.50
POSITIVE LOGITS
(
3.18
((
2.40
(-
2.34
([
2.23
(*
2.22
(&
2.17
(_
2.15
("2.14
($
2.13
({1.96
Activations Density 0.030%