INDEX
Explanations
phrases related to making official statements
statements made by individuals in formal contexts
New Auto-Interp
Negative Logits
penetration
-0.70
cius
-0.69
liest
-0.69
wat
-0.63
impro
-0.63
mismatch
-0.63
worm
-0.63
atron
-0.62
Penet
-0.61
eyed
-0.61
POSITIVE LOGITS
Statement
1.05
"â̦
0.82
"...
0.81
THANK
0.77
apology
0.77
"#
0.76
:"
0.76
"@
0.74
apologizing
0.74
statement
0.74
Activations Density 0.339%