INDEX
Explanations
instances of reporting or narration related to personal stories or experiences
New Auto-Interp
Negative Logits
nomine
-0.15
اÙħØ©
-0.14
enth
-0.14
viso
-0.14
usp
-0.13
iner
-0.13
Stranger
-0.13
.grpc
-0.13
ÅŁa
-0.13
ipse
-0.13
POSITIVE LOGITS
joining
0.24
joins
0.22
Join
0.22
Join
0.20
joining
0.20
Transcript
0.20
reporting
0.20
join
0.19
join
0.18
Reporting
0.17
Activations Density 0.028%