Feature 0: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.768

been
Token been
Feature activation-0.321
proposing
Token proposing
Feature activation-0.372
don
Token don
Feature activation-0.095
't
Token't
Feature activation-0.516
reach
Token reach
Feature activation-2.965
,
Token,
Feature activation+0.188
are
Token are
Feature activation-0.403
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.574
perspective
Token perspective
Feature activation-0.177
.
Token.
Feature activation+0.007
"
Token "
Feature activation+0.224
What
TokenWhat
Feature activation-0.755
is
Token is
Feature activation-0.398
true
Token true
Feature activation-0.657
,
Token,
Feature activation+0.099
though
Token though
Feature activation-0.390
prescriptions
Token prescriptions
Feature activation+0.063
that
Token that
Feature activation-0.171
we
Token we
Feature activation-0.058
've
Token've
Feature activation-0.057
been
Token been
Feature activation-0.193
proposing
Token proposing
Feature activation+0.363
don
Token don
Feature activation-0.794
't
Token't
Feature activation-0.589
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.893
,
Token,
Feature activation-0.143
though
Token though
Feature activation-0.648
,
Token,
Feature activation-0.103
is
Token is
Feature activation-0.004
that
Token that
Feature activation+0.373
whatever
Token whatever
Feature activation-0.478
policy
Token policy
Feature activation-0.636
prescriptions
Token prescriptions
Feature activation-0.338
that
Token that
Feature activation-0.151
we
Token we
Feature activation-0.089
<|endoftext|>
Token<|endoftext|>
Feature activation-8.769
perspective
Token perspective
Feature activation-0.417
.
Token.
Feature activation-0.356
"
Token "
Feature activation+0.179
What
TokenWhat
Feature activation-0.218
is
Token is
Feature activation-0.170
true
Token true
Feature activation-0.218
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.992
perspective
Token perspective
Feature activation-0.416
.
Token.
Feature activation-0.189
"
Token "
Feature activation+1.085
What
TokenWhat
Feature activation-0.238
is
Token is
Feature activation-0.820
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.343
perspective
Token perspective
Feature activation-0.740
.
Token.
Feature activation+0.112
"
Token "
Feature activation+1.768
What
TokenWhat
Feature activation-1.242
is
Token is
Feature activation-0.944
true
Token true
Feature activation-1.373
,
Token,
Feature activation-0.135
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.342
.
Token.
Feature activation+0.082
"
Token "
Feature activation-1.477
What
TokenWhat
Feature activation-0.650
is
Token is
Feature activation-0.605
true
Token true
Feature activation+1.487
,
Token,
Feature activation-0.115
though
Token though
Feature activation-2.502
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.045
is
Token is
Feature activation-0.175
that
Token that
Feature activation-0.653
whatever
Token whatever
Feature activation-0.259
policy
Token policy
Feature activation-0.517
prescriptions
Token prescriptions
Feature activation+0.734
that
Token that
Feature activation-0.784
we
Token we
Feature activation-0.446
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.987
perspective
Token perspective
Feature activation-0.141
.
Token.
Feature activation-0.055
"
Token "
Feature activation+0.148
What
TokenWhat
Feature activation-0.049
is
Token is
Feature activation-0.207
true
Token true
Feature activation-0.462
,
Token,
Feature activation-0.006
though
Token though
Feature activation-0.525
<|endoftext|>
Token<|endoftext|>
Feature activation-8.515
perspective
Token perspective
Feature activation-0.200
.
Token.
Feature activation-0.146
"
Token "
Feature activation-0.049
What
TokenWhat
Feature activation+0.331
is
Token is
Feature activation-0.584
true
Token true
Feature activation-0.633
,
Token,
Feature activation+0.018
though
Token though
Feature activation-0.539
,
Token,
Feature activation-0.171
<|endoftext|>
Token<|endoftext|>
Feature activation-7.187
perspective
Token perspective
Feature activation-0.132
.
Token.
Feature activation-0.208
"
Token "
Feature activation-0.069
What
TokenWhat
Feature activation+0.036
is
Token is
Feature activation-0.437
true
Token true
Feature activation-0.789
,
Token,
Feature activation-0.096
though
Token though
Feature activation-0.734
,
Token,
Feature activation-0.261
,
Token,
Feature activation-0.083
is
Token is
Feature activation-0.060
that
Token that
Feature activation-0.528
whatever
Token whatever
Feature activation-0.235
policy
Token policy
Feature activation-0.488
prescriptions
Token prescriptions
Feature activation+0.939
that
Token that
Feature activation-0.530
we
Token we
Feature activation-0.301
've
Token've
Feature activation-0.323
been
Token been
Feature activation-0.830
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.113
is
Token is
Feature activation-0.079
that
Token that
Feature activation-0.532
whatever
Token whatever
Feature activation-0.250
policy
Token policy
Feature activation-0.314
prescriptions
Token prescriptions
Feature activation+1.184
that
Token that
Feature activation-0.689
we
Token we
Feature activation-0.344
've
Token've
Feature activation-0.635
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.130
is
Token is
Feature activation-0.232
that
Token that
Feature activation-0.886
whatever
Token whatever
Feature activation-0.582
policy
Token policy
Feature activation+0.127
prescriptions
Token prescriptions
Feature activation+0.600
that
Token that
Feature activation-0.480
we
Token we
Feature activation-0.300
've
Token've
Feature activation-0.065
been
Token been
Feature activation-0.203
proposing
Token proposing
Feature activation+0.592
whatever
Token whatever
Feature activation-0.281
policy
Token policy
Feature activation-0.119
prescriptions
Token prescriptions
Feature activation-0.363
that
Token that
Feature activation-0.102
we
Token we
Feature activation-0.028
've
Token've
Feature activation+0.082
been
Token been
Feature activation-0.046
proposing
Token proposing
Feature activation-0.072
don
Token don
Feature activation-4.815
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.877
perspective
Token perspective
Feature activation-0.129
.
Token.
Feature activation-0.005
"
Token "
Feature activation+0.238
What
TokenWhat
Feature activation-0.355
is
Token is
Feature activation-0.584
true
Token true
Feature activation-0.926
,
Token,
Feature activation-0.032
though
Token though
Feature activation-0.496
<|endoftext|>
Token<|endoftext|>
Feature activation-7.820
perspective
Token perspective
Feature activation-0.237
.
Token.
Feature activation-0.022
"
Token "
Feature activation+0.689
What
TokenWhat
Feature activation-0.568
is
Token is
Feature activation-0.445
true
Token true
Feature activation-0.685
,
Token,
Feature activation-0.166
though
Token though
Feature activation-0.748
perspective
Token perspective
Feature activation-0.442
.
Token.
Feature activation-0.146
"
Token "
Feature activation+0.022
What
TokenWhat
Feature activation-0.877
is
Token is
Feature activation-0.427
true
Token true
Feature activation+1.267
,
Token,
Feature activation-0.071
though
Token though
Feature activation-1.958
,
Token,
Feature activation-0.498
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.656
perspective
Token perspective
Feature activation-0.221
.
Token.
Feature activation-0.111
"
Token "
Feature activation+0.279
What
TokenWhat
Feature activation-0.405
is
Token is
Feature activation-0.319
true
Token true
Feature activation-1.061
,
Token,
Feature activation+0.077
though
Token though
Feature activation-1.251

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.08

Head 2: 0.07

Head 3: 0.08

Head 4: 0.10

Head 5: 0.07

Head 6: 0.09

Head 7: 0.08

Head 8: 0.09

Head 9: 0.09

Head 10: 0.08

Head 11: 0.08

Positive logits

"$:/3.20

Hispan2.83

Rouse2.83

verett2.76

utenberg2.76

cffffcc2.73

Toast2.73

redirect2.70

alg2.66

Everett2.65

Merc2.59

Tul2.57

Merc2.55

avorite2.51

actionGroup2.49

leans2.49

engers2.47

lest2.47

redirected2.46

~~~~2.45

Negative logits

イト-2.77

Ara-2.74

PC-2.73

mascul-2.73

Basketball-2.70

RPG-2.69

-2.66

lyak-2.58

エル-2.57

-2.57

-2.55

Ramadan-2.50

Ri-2.50

��-2.48

-2.45

MJ-2.44

PC-2.37

leukemia-2.34

-2.34

-2.34

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

to
Token to
Feature activation+0.000
gain
Token gain
Feature activation+0.000
advantage
Token advantage
Feature activation+0.000
.
Token.
Feature activation+0.000
CH
Token CH
Feature activation+0.000
E
TokenE
Feature activation+0.000
AT
TokenAT
Feature activation+0.000
to
Token to
Feature activation+0.000
Improve
Token Improve
Feature activation+0.000
your
Token your
Feature activation+0.000
hand
Token hand
Feature activation+0.000
unt
Tokenunt
Feature activation+0.000
men
Tokenmen
Feature activation+0.000
]
Token]
Feature activation+0.000
train
Token train
Feature activation+0.000
for
Token for
Feature activation+0.000
months
Token months
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+0.000
sure
Token sure
Feature activation+0.000
that
Token that
Feature activation+0.000
every
Token every
Feature activation+0.000
factor
Token factor
Feature activation+0.000
Sunday
Token Sunday
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
Jaguars
Token Jaguars
Feature activation+0.000
were
Token were
Feature activation+0.000
in
Token in
Feature activation+0.000
it
Token it
Feature activation+0.000
.
Token.
Feature activation+0.000
When
Token When
Feature activation+0.000
he
Token he
Feature activation+0.000
and
Token and
Feature activation+0.000
potential
Token potential
Feature activation+0.000
fines
Token fines
Feature activation+0.000
stemming
Token stemming
Feature activation+0.000
from
Token from
Feature activation+0.000
the
Token the
Feature activation+0.000
2010
Token 2010
Feature activation+0.000
disaster
Token disaster
Feature activation+0.000
,
Token,
Feature activation+0.000
emergency
Token emergency
Feature activation+0.000
responders
Token responders
Feature activation+0.000
presumed
Token presumed
Feature activation+0.000
he
Token he
Feature activation+0.000
was
Token was
Feature activation+0.000
an
Token an
Feature activation+0.000
atheist
Token atheist
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
the
Token the
Feature activation+0.000
apparent
Token apparent
Feature activation+0.000
intent
Token intent
Feature activation+0.000
was
Token was
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 1: " Wilson" Name mover

TOP ACTIVATIONS
MAX = 5.589

The
Token The
Feature activation+0.000
fax
Token fax
Feature activation+0.000
indicated
Token indicated
Feature activation+4.475
it
Token it
Feature activation+0.000
was
Token was
Feature activation+3.399
suspected
Token suspected
Feature activation+5.589
that
Token that
Feature activation+4.017
he
Token he
Feature activation+0.000
was
Token was
Feature activation+0.000
impaired
Token impaired
Feature activation+0.000
by
Token by
Feature activation+0.000
from
Token from
Feature activation+3.743
the
Token the
Feature activation+0.161
medical
Token medical
Feature activation+0.000
cannabis
Token cannabis
Feature activation+0.000
,
Token,
Feature activation+3.036
terminating
Token terminating
Feature activation+5.563
Wilson
Token Wilson
Feature activation+0.000
's
Token's
Feature activation+0.000
employment
Token employment
Feature activation+0.000
.
Token.
Feature activation+3.574
Ċ
TokenĊ
Feature activation+1.196
's
Token's
Feature activation+0.000
example
Token example
Feature activation+0.000
re
Token re
Feature activation+0.000
gun
Token gun
Feature activation+0.000
laws
Token laws
Feature activation+0.000
,"
Token,"
Feature activation+4.762
she
Token she
Feature activation+0.000
tweeted
Token tweeted
Feature activation+0.000
Friday
Token Friday
Feature activation+0.000
.
Token.
Feature activation+1.695
Ċ
TokenĊ
Feature activation+0.000
to
Token to
Feature activation+2.098
the
Token the
Feature activation+0.000
record
Token record
Feature activation+0.000
,
Token,
Feature activation+2.072
as
Token as
Feature activation+2.907
did
Token did
Feature activation+4.669
complex
Token complex
Feature activation+0.000
orchestra
Token orchestra
Feature activation+0.000
numbers
Token numbers
Feature activation+0.546
played
Token played
Feature activation+0.000
by
Token by
Feature activation+2.834
paying
Token paying
Feature activation+3.976
players
Token players
Feature activation+0.000
at
Token at
Feature activation+0.000
USC
Token USC
Feature activation+0.000
.
Token.
Feature activation+2.744
But
Token But
Feature activation+4.561
it
Token it
Feature activation+0.000
came
Token came
Feature activation+0.000
out
Token out
Feature activation+0.000
so
Token so
Feature activation+1.532
fast
Token fast
Feature activation+4.539
azing
Tokenazing
Feature activation+0.000
Systems
Token Systems
Feature activation+0.000
.
Token.
Feature activation+1.889
The
Token The
Feature activation+0.000
fax
Token fax
Feature activation+0.000
indicated
Token indicated
Feature activation+4.475
it
Token it
Feature activation+0.000
was
Token was
Feature activation+3.399
suspected
Token suspected
Feature activation+5.589
that
Token that
Feature activation+4.017
he
Token he
Feature activation+0.000
weeks
Token weeks
Feature activation+0.000
.
Token.
Feature activation+2.127
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+2.990
That
TokenThat
Feature activation+0.000
leaves
Token leaves
Feature activation+4.469
Wilson
Token Wilson
Feature activation+0.000
eyeing
Token eyeing
Feature activation+0.000
a
Token a
Feature activation+0.000
possible
Token possible
Feature activation+0.000
return
Token return
Feature activation+0.000
asking
Token asking
Feature activation+4.411
the
Token the
Feature activation+0.126
right
Token right
Feature activation+0.000
questions
Token questions
Feature activation+0.836
and
Token and
Feature activation+2.698
drawing
Token drawing
Feature activation+4.459
Wilson
Token Wilson
Feature activation+0.000
out
Token out
Feature activation+0.000
on
Token on
Feature activation+2.077
some
Token some
Feature activation+0.000
touch
Token touch
Feature activation+0.000
,
Token,
Feature activation+0.000
192
Token192
Feature activation+0.000
nd
Tokennd
Feature activation+0.000
career
Token career
Feature activation+0.000
hit
Token hit
Feature activation+0.000
,
Token,
Feature activation+4.422
Sky
Token Sky
Feature activation+0.000
line
Tokenline
Feature activation+0.000
's
Token's
Feature activation+0.000
three
Token three
Feature activation+0.000
-
Token-
Feature activation+0.000
did
Token did
Feature activation+0.000
a
Token a
Feature activation+0.000
great
Token great
Feature activation+0.000
job
Token job
Feature activation+0.000
of
Token of
Feature activation+0.000
asking
Token asking
Feature activation+4.411
the
Token the
Feature activation+0.126
right
Token right
Feature activation+0.000
questions
Token questions
Feature activation+0.836
and
Token and
Feature activation+2.698
drawing
Token drawing
Feature activation+4.459
Ċ
TokenĊ
Feature activation+1.849
The
TokenThe
Feature activation+0.000
other
Token other
Feature activation+0.000
issue
Token issue
Feature activation+3.354
here
Token here
Feature activation+0.000
is
Token is
Feature activation+4.379
the
Token the
Feature activation+0.000
employer
Token employer
Feature activation+0.000
didn
Token didn
Feature activation+0.000
't
Token't
Feature activation+0.000
talk
Token talk
Feature activation+2.084
that
Token that
Feature activation+4.131
they
Token they
Feature activation+0.000
didn
Token didn
Feature activation+0.000
't
Token't
Feature activation+0.000
investigate
Token investigate
Feature activation+3.223
whether
Token whether
Feature activation+4.345
Wilson
Token Wilson
Feature activation+0.000
's
Token's
Feature activation+0.000
poor
Token poor
Feature activation+0.000
performance
Token performance
Feature activation+0.000
was
Token was
Feature activation+0.000
Asked
TokenAsked
Feature activation+0.000
Tuesday
Token Tuesday
Feature activation+0.000
if
Token if
Feature activation+3.086
he
Token he
Feature activation+0.000
's
Token's
Feature activation+0.000
concerned
Token concerned
Feature activation+4.265
about
Token about
Feature activation+3.054
the
Token the
Feature activation+0.423
dreaded
Token dreaded
Feature activation+0.010
sophomore
Token sophomore
Feature activation+0.000
slump
Token slump
Feature activation+2.221
Ŀ
TokenĿ
Feature activation+3.921
Wilson
Token Wilson
Feature activation+0.000
said
Token said
Feature activation+0.000
as
Token as
Feature activation+3.427
soon
Token soon
Feature activation+0.704
as
Token as
Feature activation+4.188
he
Token he
Feature activation+0.000
took
Token took
Feature activation+3.974
a
Token a
Feature activation+0.000
seat
Token seat
Feature activation+0.000
between
Token between
Feature activation+4.040
]
Token]
Feature activation+2.183
Ċ
TokenĊ
Feature activation+1.736
Ċ
TokenĊ
Feature activation+2.410
I
TokenI
Feature activation+0.000
double
Token double
Feature activation+0.000
checked
Token checked
Feature activation+4.179
with
Token with
Feature activation+4.001
McC
Token McC
Feature activation+0.000
le
Tokenle
Feature activation+0.000
ll
Tokenll
Feature activation+0.000
en
Tokenen
Feature activation+0.000
s
Tokens
Feature activation+0.009
didn
Token didn
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
.)
Token.)
Feature activation+4.141
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+3.694
More
TokenMore
Feature activation+0.852
importantly
Token importantly
Feature activation+0.000
,
Token,
Feature activation+4.011
s
Tokens
Feature activation+0.000
passer
Token passer
Feature activation+0.000
rating
Token rating
Feature activation+0.000
exceeded
Token exceeded
Feature activation+0.406
100
Token 100
Feature activation+0.000
.
Token.
Feature activation+4.135
(
Token (
Feature activation+2.439
K
TokenK
Feature activation+0.000
aepernick
Tokenaepernick
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
brought
Token brought
Feature activation+0.000
forward
Token forward
Feature activation+0.000
.
Token.
Feature activation+3.363
This
Token This
Feature activation+0.000
shows
Token shows
Feature activation+3.690
that
Token that
Feature activation+4.131
they
Token they
Feature activation+0.000
didn
Token didn
Feature activation+0.000
't
Token't
Feature activation+0.000
investigate
Token investigate
Feature activation+3.223
whether
Token whether
Feature activation+4.345
's
Token's
Feature activation+0.000
opponents
Token opponents
Feature activation+0.000
.
Token.
Feature activation+2.445
And
Token And
Feature activation+2.985
in
Token in
Feature activation+0.605
true
Token true
Feature activation+4.115
Ci
Token Ci
Feature activation+0.000
ara
Tokenara
Feature activation+0.000
style
Token style
Feature activation+2.633
,
Token,
Feature activation+3.057
she
Token she
Feature activation+0.000
when
Token when
Feature activation+1.283
you
Token you
Feature activation+0.000
look
Token look
Feature activation+0.000
at
Token at
Feature activation+1.304
it
Token it
Feature activation+0.000
from
Token from
Feature activation+4.113
the
Token the
Feature activation+1.642
49
Token 49
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000

Top DFA by src position
MAX = 5.801

<|endoftext|>
Token<|endoftext|>
Feature activation+0.015
Wilson
Token Wilson
Feature activation+5.801
was
Token was
Feature activation+0.062
working
Token working
Feature activation+0.017
sent
Token sent
Feature activation-0.002
a
Token a
Feature activation+0.019
fax
Token fax
Feature activation+0.021
<|endoftext|>
Token<|endoftext|>
Feature activation+0.011
Wilson
Token Wilson
Feature activation+5.033
was
Token was
Feature activation+0.016
working
Token working
Feature activation+0.013
sent
Token sent
Feature activation-0.003
a
Token a
Feature activation+0.021
fax
Token fax
Feature activation+0.010
P
TokenP
Feature activation+0.162
itch
Tokenitch
Feature activation+0.090
Perfect
Token Perfect
Feature activation+0.077
star
Token star
Feature activation+0.088
Rebel
Token Rebel
Feature activation+0.222
Wilson
Token Wilson
Feature activation+3.278
says
Token says
Feature activation+0.084
she
Token she
Feature activation+0.036
usually
Token usually
Feature activation+0.003
stays
Token stays
Feature activation-0.008
far
Token far
Feature activation-0.001
Brian
Token Brian
Feature activation+0.013
Wilson
Token Wilson
Feature activation+1.640
.
Token.
Feature activation+0.101
Ċ
TokenĊ
Feature activation-0.002
Ċ
TokenĊ
Feature activation+0.039
Wilson
TokenWilson
Feature activation+2.058
skipped
Token skipped
Feature activation+0.028
a
Token a
Feature activation+0.006
tour
Token tour
Feature activation-0.018
with
Token with
Feature activation+0.019
the
Token the
Feature activation+0.018
imm
Tokenimm
Feature activation-0.029
el
Tokenel
Feature activation+0.059
then
Token then
Feature activation-0.003
asked
Token asked
Feature activation-0.014
if
Token if
Feature activation+0.023
Wilson
Token Wilson
Feature activation+1.214
would
Token would
Feature activation+0.008
play
Token play
Feature activation-0.004
both
Token both
Feature activation+0.001
if
Token if
Feature activation+0.018
his
Token his
Feature activation-0.018
<|endoftext|>
Token<|endoftext|>
Feature activation+0.014
Wilson
Token Wilson
Feature activation+5.258
was
Token was
Feature activation+0.095
working
Token working
Feature activation+0.057
sent
Token sent
Feature activation+0.006
a
Token a
Feature activation+0.039
fax
Token fax
Feature activation+0.039
And
TokenAnd
Feature activation+0.071
when
Token when
Feature activation+0.026
the
Token the
Feature activation+0.012
Bears
Token Bears
Feature activation+0.225
placed
Token placed
Feature activation-0.032
Wilson
Token Wilson
Feature activation+3.875
on
Token on
Feature activation-0.000
IR
Token IR
Feature activation+0.017
with
Token with
Feature activation+0.006
designation
Token designation
Feature activation-0.009
to
Token to
Feature activation-0.002
imm
Tokenimm
Feature activation+0.013
el
Tokenel
Feature activation+0.038
then
Token then
Feature activation+0.031
asked
Token asked
Feature activation-0.010
if
Token if
Feature activation+0.031
Wilson
Token Wilson
Feature activation+3.161
would
Token would
Feature activation+0.022
play
Token play
Feature activation-0.009
both
Token both
Feature activation+0.003
if
Token if
Feature activation+0.049
his
Token his
Feature activation+0.000
and
Token and
Feature activation+0.005
around
Token around
Feature activation+0.012
Cincinnati
Token Cincinnati
Feature activation+0.172
,
Token,
Feature activation+0.006
like
Token like
Feature activation+0.089
Wilson
Token Wilson
Feature activation+3.094
,
Token,
Feature activation+0.029
swear
Token swear
Feature activation+0.013
by
Token by
Feature activation+0.105
both
Token both
Feature activation+0.014
.
Token.
Feature activation+0.724
imm
Tokenimm
Feature activation-0.053
el
Tokenel
Feature activation+0.094
then
Token then
Feature activation+0.043
asked
Token asked
Feature activation+0.017
if
Token if
Feature activation+0.059
Wilson
Token Wilson
Feature activation+2.710
would
Token would
Feature activation+0.022
play
Token play
Feature activation-0.013
both
Token both
Feature activation+0.003
if
Token if
Feature activation+0.089
his
Token his
Feature activation-0.024
ied
Tokenied
Feature activation-0.015
the
Token the
Feature activation-0.005
waters
Token waters
Feature activation-0.001
by
Token by
Feature activation+0.029
firing
Token firing
Feature activation-0.023
Wilson
Token Wilson
Feature activation+4.836
when
Token when
Feature activation+0.078
the
Token the
Feature activation+0.024
complaint
Token complaint
Feature activation+0.006
about
Token about
Feature activation+0.089
his
Token his
Feature activation-0.080
ied
Tokenied
Feature activation-0.003
the
Token the
Feature activation-0.006
waters
Token waters
Feature activation+0.001
by
Token by
Feature activation+0.012
firing
Token firing
Feature activation-0.035
Wilson
Token Wilson
Feature activation+2.928
when
Token when
Feature activation+0.065
the
Token the
Feature activation+0.025
complaint
Token complaint
Feature activation+0.003
about
Token about
Feature activation+0.069
his
Token his
Feature activation-0.092
right
Token right
Feature activation+0.005
?
Token?
Feature activation+0.041
Ċ
TokenĊ
Feature activation+0.021
Ċ
TokenĊ
Feature activation-0.019
Russell
TokenRussell
Feature activation+0.711
Wilson
Token Wilson
Feature activation+4.372
insists
Token insists
Feature activation+0.244
it
Token it
Feature activation+0.005
won
Token won
Feature activation+0.001
't
Token't
Feature activation+0.002
be
Token be
Feature activation+0.030
<|endoftext|>
Token<|endoftext|>
Feature activation-0.025
Ŀ
TokenĿ
Feature activation-0.061
talk
Token talk
Feature activation+0.089
,
Token,
Feature activation+0.029
Wilson
Token Wilson
Feature activation+2.794
stepped
Token stepped
Feature activation-0.042
way
Token way
Feature activation+0.015
out
Token out
Feature activation-0.000
of
Token of
Feature activation+0.040
his
Token his
Feature activation+0.023
<|endoftext|>
Token<|endoftext|>
Feature activation+0.002
but
Token but
Feature activation+0.116
before
Token before
Feature activation+0.011
Wilson
Token Wilson
Feature activation+4.478
published
Token published
Feature activation+0.131
his
Token his
Feature activation-0.051
op
Token op
Feature activation-0.064
-
Token-
Feature activation-0.014
ed
Tokened
Feature activation-0.010
.
Token.
Feature activation-0.002
Ċ
TokenĊ
Feature activation+0.081
Ċ
TokenĊ
Feature activation+0.010
As
TokenAs
Feature activation+0.017
to
Token to
Feature activation+0.006
Wilson
Token Wilson
Feature activation+1.418
,
Token,
Feature activation+0.061
he
Token he
Feature activation-0.005
âĢ
TokenâĢ
Feature activation+0.060
Ļ
TokenĻ
Feature activation+0.005
s
Tokens
Feature activation-0.007
.
Token.
Feature activation+0.001
Ċ
TokenĊ
Feature activation+0.061
Ċ
TokenĊ
Feature activation+0.008
As
TokenAs
Feature activation+0.011
to
Token to
Feature activation+0.009
Wilson
Token Wilson
Feature activation+1.905
,
Token,
Feature activation+0.033
he
Token he
Feature activation-0.011
âĢ
TokenâĢ
Feature activation+0.012
Ļ
TokenĻ
Feature activation+0.005
s
Tokens
Feature activation+0.001
ied
Tokenied
Feature activation-0.016
the
Token the
Feature activation-0.010
waters
Token waters
Feature activation+0.001
by
Token by
Feature activation+0.016
firing
Token firing
Feature activation-0.040
Wilson
Token Wilson
Feature activation+2.355
when
Token when
Feature activation+0.038
the
Token the
Feature activation+0.010
complaint
Token complaint
Feature activation+0.003
about
Token about
Feature activation+0.047
his
Token his
Feature activation-0.025
,
Token,
Feature activation+0.016
who
Token who
Feature activation+0.026
is
Token is
Feature activation+0.009
pregnant
Token pregnant
Feature activation+0.002
with
Token with
Feature activation+0.021
Wilson
Token Wilson
Feature activation+4.810
's
Token's
Feature activation+0.003
baby
Token baby
Feature activation+0.002
,
Token,
Feature activation+0.013
was
Token was
Feature activation+0.015
reportedly
Token reportedly
Feature activation+0.007
losing
Token losing
Feature activation-0.006
13
Token 13
Feature activation-0.001
-
Token-
Feature activation-0.004
6
Token6
Feature activation-0.004
.
Token.
Feature activation+0.102
Wilson
Token Wilson
Feature activation+3.720
has
Token has
Feature activation+0.013
been
Token been
Feature activation-0.004
fantastic
Token fantastic
Feature activation+0.002
ever
Token ever
Feature activation-0.005
since
Token since
Feature activation+0.010

Decoder Weights Distribution

Head 0: 0.05

Head 1: 0.08

Head 2: 0.19

Head 3: 0.12

Head 4: 0.05

Head 5: 0.04

Head 6: 0.08

Head 7: 0.05

Head 8: 0.11

Head 9: 0.05

Head 10: 0.11

Head 11: 0.07

Positive logits

Wilson8.67

Wilson8.29

Russell5.97

Seahawks5.96

Russell5.39

Lynch5.02

Seattle4.81

Carroll4.55

Sherman4.45

Seattle4.39

Cox4.13

Everett4.04

Pete3.99

Baldwin3.88

Ballard3.88

Seah3.85

Burgess3.75

Rodgers3.66

Bennett3.52

Palmer3.52

Negative logits

Karachi-3.54

Malta-3.44

hepatitis-3.34

Nuclear-3.31

Boko-3.29

lich-3.16

Genie-3.13

uranium-3.11

Sind-3.10

Libyan-3.06

Iranians-3.01

orph-3.01

-2.97

Mumbai-2.97

uclear-2.92

Iranian-2.87

Monteneg-2.84

Nigerian-2.83

Saudi-2.80

Iran-2.78

INTERVAL 5.030 - 5.589
CONTAINS 0.000%

The
Token The
Feature activation+0.000
fax
Token fax
Feature activation+0.000
indicated
Token indicated
Feature activation+4.475
it
Token it
Feature activation+0.000
was
Token was
Feature activation+3.399
suspected
Token suspected
Feature activation+5.589
that
Token that
Feature activation+4.017
he
Token he
Feature activation+0.000
was
Token was
Feature activation+0.000
impaired
Token impaired
Feature activation+0.000
by
Token by
Feature activation+0.000
from
Token from
Feature activation+3.743
the
Token the
Feature activation+0.161
medical
Token medical
Feature activation+0.000
cannabis
Token cannabis
Feature activation+0.000
,
Token,
Feature activation+3.036
terminating
Token terminating
Feature activation+5.563
Wilson
Token Wilson
Feature activation+0.000
's
Token's
Feature activation+0.000
employment
Token employment
Feature activation+0.000
.
Token.
Feature activation+3.574
Ċ
TokenĊ
Feature activation+1.196

INTERVAL 4.471 - 5.030
CONTAINS 0.000%

azing
Tokenazing
Feature activation+0.000
Systems
Token Systems
Feature activation+0.000
.
Token.
Feature activation+1.889
The
Token The
Feature activation+0.000
fax
Token fax
Feature activation+0.000
indicated
Token indicated
Feature activation+4.475
it
Token it
Feature activation+0.000
was
Token was
Feature activation+3.399
suspected
Token suspected
Feature activation+5.589
that
Token that
Feature activation+4.017
he
Token he
Feature activation+0.000
to
Token to
Feature activation+2.098
the
Token the
Feature activation+0.000
record
Token record
Feature activation+0.000
,
Token,
Feature activation+2.072
as
Token as
Feature activation+2.907
did
Token did
Feature activation+4.669
complex
Token complex
Feature activation+0.000
orchestra
Token orchestra
Feature activation+0.000
numbers
Token numbers
Feature activation+0.546
played
Token played
Feature activation+0.000
by
Token by
Feature activation+2.834
paying
Token paying
Feature activation+3.976
players
Token players
Feature activation+0.000
at
Token at
Feature activation+0.000
USC
Token USC
Feature activation+0.000
.
Token.
Feature activation+2.744
But
Token But
Feature activation+4.561
it
Token it
Feature activation+0.000
came
Token came
Feature activation+0.000
out
Token out
Feature activation+0.000
so
Token so
Feature activation+1.532
fast
Token fast
Feature activation+4.539
's
Token's
Feature activation+0.000
example
Token example
Feature activation+0.000
re
Token re
Feature activation+0.000
gun
Token gun
Feature activation+0.000
laws
Token laws
Feature activation+0.000
,"
Token,"
Feature activation+4.762
she
Token she
Feature activation+0.000
tweeted
Token tweeted
Feature activation+0.000
Friday
Token Friday
Feature activation+0.000
.
Token.
Feature activation+1.695
Ċ
TokenĊ
Feature activation+0.000

INTERVAL 3.912 - 4.471
CONTAINS 0.000%

least
Token least
Feature activation+1.835
attempts
Token attempts
Feature activation+0.000
at
Token at
Feature activation+0.000
comedy
Token comedy
Feature activation+0.000
),
Token),
Feature activation+1.948
including
Token including
Feature activation+4.038
a
Token a
Feature activation+0.425
quick
Token quick
Feature activation+0.224
jab
Token jab
Feature activation+0.777
by
Token by
Feature activation+3.422
Wilson
Token Wilson
Feature activation+0.000
when
Token when
Feature activation+1.283
you
Token you
Feature activation+0.000
look
Token look
Feature activation+0.000
at
Token at
Feature activation+1.304
it
Token it
Feature activation+0.000
from
Token from
Feature activation+4.113
the
Token the
Feature activation+1.642
49
Token 49
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
fax
Token fax
Feature activation+0.000
indicated
Token indicated
Feature activation+4.475
it
Token it
Feature activation+0.000
was
Token was
Feature activation+3.399
suspected
Token suspected
Feature activation+5.589
that
Token that
Feature activation+4.017
he
Token he
Feature activation+0.000
was
Token was
Feature activation+0.000
impaired
Token impaired
Feature activation+0.000
by
Token by
Feature activation+0.000
the
Token the
Feature activation+0.000
Ċ
TokenĊ
Feature activation+1.736
Ċ
TokenĊ
Feature activation+2.410
I
TokenI
Feature activation+0.000
double
Token double
Feature activation+0.000
checked
Token checked
Feature activation+4.179
with
Token with
Feature activation+4.001
McC
Token McC
Feature activation+0.000
le
Tokenle
Feature activation+0.000
ll
Tokenll
Feature activation+0.000
en
Tokenen
Feature activation+0.000
to
Token to
Feature activation+0.000
Ċ
TokenĊ
Feature activation+1.849
The
TokenThe
Feature activation+0.000
other
Token other
Feature activation+0.000
issue
Token issue
Feature activation+3.354
here
Token here
Feature activation+0.000
is
Token is
Feature activation+4.379
the
Token the
Feature activation+0.000
employer
Token employer
Feature activation+0.000
didn
Token didn
Feature activation+0.000
't
Token't
Feature activation+0.000
talk
Token talk
Feature activation+2.084

INTERVAL 3.353 - 3.912
CONTAINS 0.001%

worker
Token worker
Feature activation+0.000
at
Token at
Feature activation+0.170
the
Token the
Feature activation+0.000
STA
Token STA
Feature activation+0.000
,
Token,
Feature activation+0.000
said
Token said
Feature activation+3.476
:
Token:
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ĺ
Tokenĺ
Feature activation+3.536
There
TokenThere
Feature activation+0.000
are
Token are
Feature activation+0.000
.
Token.
Feature activation+2.185
Ċ
TokenĊ
Feature activation+1.799
Ċ
TokenĊ
Feature activation+1.849
The
TokenThe
Feature activation+0.000
other
Token other
Feature activation+0.000
issue
Token issue
Feature activation+3.354
here
Token here
Feature activation+0.000
is
Token is
Feature activation+4.379
the
Token the
Feature activation+0.000
employer
Token employer
Feature activation+0.000
didn
Token didn
Feature activation+0.000
many
Token many
Feature activation+0.000
of
Token of
Feature activation+2.264
you
Token you
Feature activation+0.000
are
Token are
Feature activation+0.000
using
Token using
Feature activation+0.000
,
Token,
Feature activation+3.526
the
Token the
Feature activation+0.073
provision
Token provision
Feature activation+0.000
of
Token of
Feature activation+0.753
400
Token 400
Feature activation+0.000
mg
Tokenmg
Feature activation+0.000
(
Token (
Feature activation+0.000
2015
Token2015
Feature activation+0.000
).
Token).
Feature activation+2.739
More
Token More
Feature activation+0.318
recently
Token recently
Feature activation+1.935
,
Token,
Feature activation+3.604
he
Token he
Feature activation+0.000
has
Token has
Feature activation+0.000
played
Token played
Feature activation+0.000
a
Token a
Feature activation+0.000
recurring
Token recurring
Feature activation+0.000
stated
Token stated
Feature activation+3.104
that
Token that
Feature activation+3.198
his
Token his
Feature activation+0.000
was
Token was
Feature activation+1.021
fired
Token fired
Feature activation+0.000
because
Token because
Feature activation+3.454
of
Token of
Feature activation+4.064
his
Token his
Feature activation+0.228
temper
Token temper
Feature activation+0.000
,
Token,
Feature activation+0.491
lack
Token lack
Feature activation+0.000

INTERVAL 2.794 - 3.353
CONTAINS 0.001%

or
Token or
Feature activation+0.000
programming
Token programming
Feature activation+0.000
out
Token out
Feature activation+0.000
of
Token of
Feature activation+1.202
each
Token each
Feature activation+0.000
of
Token of
Feature activation+2.977
them
Token them
Feature activation+0.000
),
Token),
Feature activation+2.450
this
Token this
Feature activation+0.728
one
Token one
Feature activation+0.000
seems
Token seems
Feature activation+0.000
partnerships
Token partnerships
Feature activation+0.000
differ
Token differ
Feature activation+0.000
widely
Token widely
Feature activation+0.000
in
Token in
Feature activation+0.000
value
Token value
Feature activation+0.000
.
Token.
Feature activation+3.210
Buzz
Token Buzz
Feature activation+0.000
feed
Tokenfeed
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
saw
Token saw
Feature activation+3.037
)
Token)
Feature activation+0.000
and
Token and
Feature activation+0.734
then
Token then
Feature activation+0.000
stated
Token stated
Feature activation+3.418
they
Token they
Feature activation+0.000
terminated
Token terminated
Feature activation+2.854
him
Token him
Feature activation+0.000
because
Token because
Feature activation+2.414
of
Token of
Feature activation+2.090
poor
Token poor
Feature activation+0.000
performance
Token performance
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.261
Ċ
TokenĊ
Feature activation+1.793
Wilson
TokenWilson
Feature activation+0.000
then
Token then
Feature activation+0.000
reiterated
Token reiterated
Feature activation+2.621
that
Token that
Feature activation+2.826
he
Token he
Feature activation+0.000
wants
Token wants
Feature activation+2.360
to
Token to
Feature activation+0.000
stay
Token stay
Feature activation+0.000
in
Token in
Feature activation+0.000
increasing
Token increasing
Feature activation+0.000
business
Token business
Feature activation+0.000
demand
Token demand
Feature activation+0.000
for
Token for
Feature activation+0.000
innovation
Token innovation
Feature activation+0.000
,"
Token,"
Feature activation+2.839
said
Token said
Feature activation+2.848
Mr
Token Mr
Feature activation+2.127
.
Token.
Feature activation+1.622
Wilson
Token Wilson
Feature activation+0.000
.
Token.
Feature activation+0.430

INTERVAL 2.235 - 2.794
CONTAINS 0.002%

the
Token the
Feature activation+0.000
current
Token current
Feature activation+0.000
staple
Token staple
Feature activation+0.000
supplements
Token supplements
Feature activation+0.564
many
Token many
Feature activation+0.000
of
Token of
Feature activation+2.264
you
Token you
Feature activation+0.000
are
Token are
Feature activation+0.000
using
Token using
Feature activation+0.000
,
Token,
Feature activation+3.526
the
Token the
Feature activation+0.073
Ed
Token Ed
Feature activation+0.000
Carpenter
Token Carpenter
Feature activation+0.000
,
Token,
Feature activation+0.000
who
Token who
Feature activation+0.000
was
Token was
Feature activation+1.853
with
Token with
Feature activation+2.236
Miles
Token Miles
Feature activation+0.000
,
Token,
Feature activation+1.410
representing
Token representing
Feature activation+0.228
all
Token all
Feature activation+0.000
Indy
Token Indy
Feature activation+0.000
took
Token took
Feature activation+0.670
medication
Token medication
Feature activation+0.000
for
Token for
Feature activation+0.000
his
Token his
Feature activation+0.000
disability
Token disability
Feature activation+0.000
.
Token.
Feature activation+2.290
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+2.932
The
TokenThe
Feature activation+0.000
Project
Token Project
Feature activation+0.000
Superintendent
Token Superintendent
Feature activation+0.000
much
Token much
Feature activation+2.751
of
Token of
Feature activation+2.132
that
Token that
Feature activation+0.000
was
Token was
Feature activation+2.798
coaching
Token coaching
Feature activation+1.018
.
Token.
Feature activation+2.486
The
Token The
Feature activation+0.000
answer
Token answer
Feature activation+0.000
is
Token is
Feature activation+0.238
none
Token none
Feature activation+0.000
.
Token.
Feature activation+2.440
drama
Token drama
Feature activation+0.000
ensemble
Token ensemble
Feature activation+0.000
for
Token for
Feature activation+0.000
the
Token the
Feature activation+0.000
series
Token series
Feature activation+0.000
.
Token.
Feature activation+2.535
He
Token He
Feature activation+0.000
also
Token also
Feature activation+0.000
had
Token had
Feature activation+0.000
minor
Token minor
Feature activation+0.000
roles
Token roles
Feature activation+0.000

INTERVAL 1.677 - 2.235
CONTAINS 0.003%

to
Token to
Feature activation+0.000
do
Token do
Feature activation+0.000
the
Token the
Feature activation+0.000
same
Token same
Feature activation+0.000
thing
Token thing
Feature activation+0.481
in
Token in
Feature activation+1.679
2014
Token 2014
Feature activation+0.000
before
Token before
Feature activation+1.890
there
Token there
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
and
Token and
Feature activation+0.000
information
Token information
Feature activation+0.000
on
Token on
Feature activation+1.522
University
Token University
Feature activation+0.000
projects
Token projects
Feature activation+0.000
that
Token that
Feature activation+2.066
were
Token were
Feature activation+0.000
taking
Token taking
Feature activation+0.000
place
Token place
Feature activation+0.000
during
Token during
Feature activation+2.077
his
Token his
Feature activation+0.000
the
Token the
Feature activation+0.000
passing
Token passing
Feature activation+0.000
game
Token game
Feature activation+0.000
.
Token.
Feature activation+2.299
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+2.213
âĢ
TokenâĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+2.508
I
TokenI
Feature activation+0.000
can
Token can
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
pretending
Token pretending
Feature activation+0.196
to
Token to
Feature activation+0.000
have
Token have
Feature activation+0.000
the
Token the
Feature activation+0.000
flu
Token flu
Feature activation+0.000
.
Token.
Feature activation+2.148
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+2.615
So
TokenSo
Feature activation+2.382
I
Token I
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
was
Token was
Feature activation+0.000
in
Token in
Feature activation+1.334
a
Token a
Feature activation+0.000
plastic
Token plastic
Feature activation+0.000
bubble
Token bubble
Feature activation+0.000
because
Token because
Feature activation+1.721
he
Token he
Feature activation+0.000
was
Token was
Feature activation+0.000
pretending
Token pretending
Feature activation+0.196
to
Token to
Feature activation+0.000
have
Token have
Feature activation+0.000

INTERVAL 1.118 - 1.677
CONTAINS 0.004%

department
Token department
Feature activation+0.000
al
Tokenal
Feature activation+0.000
records
Token records
Feature activation+0.000
and
Token and
Feature activation+0.000
information
Token information
Feature activation+0.000
on
Token on
Feature activation+1.522
University
Token University
Feature activation+0.000
projects
Token projects
Feature activation+0.000
that
Token that
Feature activation+2.066
were
Token were
Feature activation+0.000
taking
Token taking
Feature activation+0.000
at
Token at
Feature activation+0.000
How
Token How
Feature activation+0.000
elsen
Tokenelsen
Feature activation+0.000
.
Token.
Feature activation+1.498
Give
Token Give
Feature activation+2.024
me
Token me
Feature activation+1.149
a
Token a
Feature activation+0.000
90
Token 90
Feature activation+0.000
-
Token-
Feature activation+0.000
meter
Tokenmeter
Feature activation+0.000
Big
Token Big
Feature activation+0.000
in
Token in
Feature activation+0.000
2013
Token 2013
Feature activation+0.000
.
Token.
Feature activation+2.855
Other
Token Other
Feature activation+0.000
film
Token film
Feature activation+0.000
credits
Token credits
Feature activation+1.218
include
Token include
Feature activation+0.000
lead
Token lead
Feature activation+0.000
roles
Token roles
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
Reich
Token Reich
Feature activation+0.000
.
Token.
Feature activation+0.000
According
Token According
Feature activation+0.000
to
Token to
Feature activation+0.000
Wilson
Token Wilson
Feature activation+0.000
,
Token,
Feature activation+1.416
Breitbart
Token Breitbart
Feature activation+0.000
is
Token is
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
acting
Tokenacting
Feature activation+0.000
between
Token between
Feature activation+0.000
Wilson
Token Wilson
Feature activation+0.000
and
Token and
Feature activation+0.459
Rogers
Token Rogers
Feature activation+0.000
,
Token,
Feature activation+0.000
with
Token with
Feature activation+1.307
Hy
Token Hy
Feature activation+0.000
ten
Tokenten
Feature activation+0.000
stuck
Token stuck
Feature activation+0.000
uncom
Token uncom
Feature activation+0.000
fort
Tokenfort
Feature activation+0.000

INTERVAL 0.559 - 1.118
CONTAINS 0.006%

Herbert
Token Herbert
Feature activation+0.000
or
Token or
Feature activation+0.000
Wilson
Token Wilson
Feature activation+0.000
.
Token.
Feature activation+0.760
Ċ
TokenĊ
Feature activation+0.117
Ċ
TokenĊ
Feature activation+1.012
What
TokenWhat
Feature activation+1.069
we
Token we
Feature activation+0.000
hope
Token hope
Feature activation+0.381
to
Token to
Feature activation+0.000
learn
Token learn
Feature activation+0.000
be
Token be
Feature activation+0.000
n
Token n
Feature activation+0.000
imble
Tokenimble
Feature activation+0.000
enough
Token enough
Feature activation+0.000
to
Token to
Feature activation+0.000
avoid
Token avoid
Feature activation+0.874
him
Token him
Feature activation+0.000
.
Token.
Feature activation+0.099
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
like
Token like
Feature activation+0.000
his
Token his
Feature activation+0.000
dad
Token dad
Feature activation+0.000
."
Token."
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.710
Jim
TokenJim
Feature activation+0.000
Johann
Token Johann
Feature activation+0.000
son
Tokenson
Feature activation+0.000
,
Token,
Feature activation+0.000
who
Token who
Feature activation+0.000
glimpse
Token glimpse
Feature activation+0.000
of
Token of
Feature activation+1.245
who
Token who
Feature activation+1.879
's
Token's
Feature activation+0.000
running
Token running
Feature activation+0.072
with
Token with
Feature activation+0.866
the
Token the
Feature activation+0.000
ones
Token ones
Feature activation+0.169
.
Token.
Feature activation+0.072
We
Token We
Feature activation+0.000
may
Token may
Feature activation+0.000
But
TokenBut
Feature activation+0.946
with
Token with
Feature activation+0.000
the
Token the
Feature activation+0.000
plea
Token plea
Feature activation+0.000
deal
Token deal
Feature activation+0.783
,
Token,
Feature activation+0.778
Sim
Token Sim
Feature activation+0.000
cox
Tokencox
Feature activation+0.000
could
Token could
Feature activation+0.000
be
Token be
Feature activation+0.000
out
Token out
Feature activation+0.000

INTERVAL 0.000 - 0.559
CONTAINS 99.984%

its
Token its
Feature activation+0.000
strongest
Token strongest
Feature activation+0.000
,
Token,
Feature activation+0.000
with
Token with
Feature activation+0.000
four
Token four
Feature activation+0.000
box
Token box
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
ranked
Token ranked
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
top
Token top
Feature activation+0.000
.
Token.
Feature activation+0.000
Bar
Token Bar
Feature activation+0.000
ring
Tokenring
Feature activation+0.000
a
Token a
Feature activation+0.000
miracle
Token miracle
Feature activation+0.000
,
Token,
Feature activation+0.000
this
Token this
Feature activation+0.000
April
Token April
Feature activation+0.000
the
Token the
Feature activation+0.000
Buffalo
Token Buffalo
Feature activation+0.000
Sabres
Token Sabres
Feature activation+0.000
.
Token.
Feature activation+0.000
It
Token It
Feature activation+0.000
was
Token was
Feature activation+0.000
either
Token either
Feature activation+0.000
be
Token be
Feature activation+0.000
lost
Token lost
Feature activation+0.000
or
Token or
Feature activation+0.000
be
Token be
Feature activation+0.000
bored
Token bored
Feature activation+0.000
.
Token.
Feature activation+0.000
And
Token And
Feature activation+0.000
of
Token of
Feature activation+0.000
Obama
Token Obama
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
job
Token job
Feature activation+0.000
performance
Token performance
Feature activation+0.000
.
Token.
Feature activation+0.000
That
Token That
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
a
Token a
Feature activation+0.000
male
Token male
Feature activation+0.000
politician
Token politician
Feature activation+0.000
caught
Token caught
Feature activation+0.000
on
Token on
Feature activation+0.000
camera
Token camera
Feature activation+0.000
ch
Token ch
Feature activation+0.000
ort
Tokenort
Feature activation+0.000
ling
Tokenling
Feature activation+0.000
at
Token at
Feature activation+0.000
the
Token the
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 2: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 2.202

<|endoftext|>
Token<|endoftext|>
Feature activation-6.185
perspective
Token perspective
Feature activation-0.107
.
Token.
Feature activation-0.162
"
Token "
Feature activation+0.223
What
TokenWhat
Feature activation-0.400
is
Token is
Feature activation-0.270
true
Token true
Feature activation-0.943
,
Token,
Feature activation+0.055
though
Token though
Feature activation-0.055
<|endoftext|>
Token<|endoftext|>
Feature activation-5.982
perspective
Token perspective
Feature activation-0.093
.
Token.
Feature activation-0.006
"
Token "
Feature activation+0.440
What
TokenWhat
Feature activation-0.717
is
Token is
Feature activation-0.293
true
Token true
Feature activation-0.495
,
Token,
Feature activation+0.060
though
Token though
Feature activation-0.139
prescriptions
Token prescriptions
Feature activation-0.161
that
Token that
Feature activation-0.143
we
Token we
Feature activation-0.028
've
Token've
Feature activation-0.010
been
Token been
Feature activation-0.099
proposing
Token proposing
Feature activation+0.079
don
Token don
Feature activation-0.761
't
Token't
Feature activation-0.473
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.724
,
Token,
Feature activation+0.063
though
Token though
Feature activation-0.173
,
Token,
Feature activation-0.033
is
Token is
Feature activation-0.007
that
Token that
Feature activation+0.380
whatever
Token whatever
Feature activation-0.415
policy
Token policy
Feature activation-0.495
prescriptions
Token prescriptions
Feature activation-0.199
that
Token that
Feature activation-0.113
we
Token we
Feature activation+0.026
<|endoftext|>
Token<|endoftext|>
Feature activation-8.020
perspective
Token perspective
Feature activation-0.140
.
Token.
Feature activation-0.309
"
Token "
Feature activation+0.806
What
TokenWhat
Feature activation-0.325
is
Token is
Feature activation+0.163
true
Token true
Feature activation-0.105
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.222
perspective
Token perspective
Feature activation-0.161
.
Token.
Feature activation-0.154
"
Token "
Feature activation+1.878
What
TokenWhat
Feature activation-0.307
is
Token is
Feature activation-0.258
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.657
perspective
Token perspective
Feature activation-0.424
.
Token.
Feature activation+0.084
"
Token "
Feature activation+2.202
What
TokenWhat
Feature activation-1.191
is
Token is
Feature activation-0.627
true
Token true
Feature activation-1.003
,
Token,
Feature activation+0.208
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.218
.
Token.
Feature activation-0.123
"
Token "
Feature activation-1.586
What
TokenWhat
Feature activation-0.640
is
Token is
Feature activation-0.348
true
Token true
Feature activation+0.544
,
Token,
Feature activation+0.327
though
Token though
Feature activation-0.645
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.115
is
Token is
Feature activation-0.122
that
Token that
Feature activation-0.545
whatever
Token whatever
Feature activation-0.216
policy
Token policy
Feature activation-0.154
prescriptions
Token prescriptions
Feature activation+1.106
that
Token that
Feature activation-0.520
we
Token we
Feature activation-0.257
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.224
perspective
Token perspective
Feature activation-0.096
.
Token.
Feature activation-0.081
"
Token "
Feature activation+0.524
What
TokenWhat
Feature activation-0.154
is
Token is
Feature activation-0.123
true
Token true
Feature activation-0.366
,
Token,
Feature activation+0.069
though
Token though
Feature activation-0.162
<|endoftext|>
Token<|endoftext|>
Feature activation-7.665
perspective
Token perspective
Feature activation-0.123
.
Token.
Feature activation-0.176
"
Token "
Feature activation+0.356
What
TokenWhat
Feature activation+0.034
is
Token is
Feature activation-0.405
true
Token true
Feature activation-0.422
,
Token,
Feature activation+0.064
though
Token though
Feature activation-0.149
<|endoftext|>
Token<|endoftext|>
Feature activation-6.436
perspective
Token perspective
Feature activation-0.092
.
Token.
Feature activation-0.230
"
Token "
Feature activation+0.425
What
TokenWhat
Feature activation-0.123
is
Token is
Feature activation-0.228
true
Token true
Feature activation-0.633
,
Token,
Feature activation+0.061
though
Token though
Feature activation-0.208
,
Token,
Feature activation-0.085
is
Token is
Feature activation-0.029
that
Token that
Feature activation-0.436
whatever
Token whatever
Feature activation-0.210
policy
Token policy
Feature activation-0.039
prescriptions
Token prescriptions
Feature activation+1.111
that
Token that
Feature activation-0.372
we
Token we
Feature activation-0.056
've
Token've
Feature activation-0.120
been
Token been
Feature activation-0.624
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.086
is
Token is
Feature activation-0.066
that
Token that
Feature activation-0.470
whatever
Token whatever
Feature activation-0.232
policy
Token policy
Feature activation+0.071
prescriptions
Token prescriptions
Feature activation+1.269
that
Token that
Feature activation-0.490
we
Token we
Feature activation-0.106
've
Token've
Feature activation-0.491
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
though
Token though
Feature activation-0.183
,
Token,
Feature activation-0.085
is
Token is
Feature activation-0.188
that
Token that
Feature activation-0.699
whatever
Token whatever
Feature activation-0.414
policy
Token policy
Feature activation+0.359
prescriptions
Token prescriptions
Feature activation+0.303
that
Token that
Feature activation-0.258
we
Token we
Feature activation-0.109
've
Token've
Feature activation+0.035
been
Token been
Feature activation-0.093
whatever
Token whatever
Feature activation-0.212
policy
Token policy
Feature activation-0.099
prescriptions
Token prescriptions
Feature activation-0.282
that
Token that
Feature activation-0.077
we
Token we
Feature activation-0.006
've
Token've
Feature activation+0.087
been
Token been
Feature activation-0.038
proposing
Token proposing
Feature activation-0.064
don
Token don
Feature activation-5.513
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.135
perspective
Token perspective
Feature activation-0.018
.
Token.
Feature activation-0.074
"
Token "
Feature activation+0.639
What
TokenWhat
Feature activation-0.345
is
Token is
Feature activation-0.359
true
Token true
Feature activation-0.676
,
Token,
Feature activation+0.119
though
Token though
Feature activation-0.147
<|endoftext|>
Token<|endoftext|>
Feature activation-7.162
perspective
Token perspective
Feature activation-0.048
.
Token.
Feature activation-0.055
"
Token "
Feature activation+1.144
What
TokenWhat
Feature activation-0.477
is
Token is
Feature activation-0.188
true
Token true
Feature activation-0.480
,
Token,
Feature activation+0.129
though
Token though
Feature activation-0.211
perspective
Token perspective
Feature activation-0.188
.
Token.
Feature activation-0.184
"
Token "
Feature activation+0.528
What
TokenWhat
Feature activation-0.838
is
Token is
Feature activation-0.301
true
Token true
Feature activation+0.813
,
Token,
Feature activation+0.205
though
Token though
Feature activation-0.694
,
Token,
Feature activation-0.102
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.967
perspective
Token perspective
Feature activation-0.076
.
Token.
Feature activation-0.158
"
Token "
Feature activation+0.517
What
TokenWhat
Feature activation-0.445
is
Token is
Feature activation-0.130
true
Token true
Feature activation-0.922
,
Token,
Feature activation+0.235
though
Token though
Feature activation-0.367

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.09

Head 2: 0.07

Head 3: 0.08

Head 4: 0.08

Head 5: 0.09

Head 6: 0.08

Head 7: 0.09

Head 8: 0.10

Head 9: 0.09

Head 10: 0.08

Head 11: 0.08

Positive logits

[+2.94

CLA2.89

saint2.83

insin2.56

accusing2.56

criticised2.49

hess2.43

Rust2.43

patron2.42

prick2.40

her2.37

wart2.34

claim2.34

critic2.32

agreeing2.31

iberal2.28

wcs2.28

Rust2.26

branding2.26

heroine2.26

Negative logits

soDeliveryDate-3.37

endi-2.96

hower-2.95

itaire-2.89

ellen-2.87

omnia-2.84

arden-2.81

dor-2.78

Gard-2.75

ando-2.74

ieri-2.67

hill-2.64

paio-2.59

retri-2.59

tablets-2.58

mony-2.57

endars-2.54

Pryor-2.51

Robbins-2.49

emetery-2.47

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

Ŀ
TokenĿ
Feature activation+0.000
He
Token He
Feature activation+0.000
called
Token called
Feature activation+0.000
the
Token the
Feature activation+0.000
settlement
Token settlement
Feature activation+0.000
"
Token "
Feature activation+0.000
a
Tokena
Feature activation+0.000
major
Token major
Feature activation+0.000
victory
Token victory
Feature activation+0.000
for
Token for
Feature activation+0.000
the
Token the
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
EU
Token EU
Feature activation+0.000
still
Token still
Feature activation+0.000
struggles
Token struggles
Feature activation+0.000
to
Token to
Feature activation+0.000
find
Token find
Feature activation+0.000
synerg
Token synerg
Feature activation+0.000
ies
Tokenies
Feature activation+0.000
between
Token between
Feature activation+0.000
its
Token its
Feature activation+0.000
this
Token this
Feature activation+0.000
music
Token music
Feature activation+0.000
designed
Token designed
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
sacred
Token sacred
Feature activation+0.000
someday
Token someday
Feature activation+0.000
?
Token?
Feature activation+0.000
The
Token The
Feature activation+0.000
essence
Token essence
Feature activation+0.000
of
Token of
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
can
Token can
Feature activation+0.000
have
Token have
Feature activation+0.000
the
Token the
Feature activation+0.000
database
Token database
Feature activation+0.000
,
Token,
Feature activation+0.000
rather
Token rather
Feature activation+0.000
than
Token than
Feature activation+0.000
Active
Token Active
Feature activation+0.000
Record
TokenRecord
Feature activation+0.000
that
Token that
Feature activation+0.000
requirement
Token requirement
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Is
TokenIs
Feature activation+0.000
North
Token North
Feature activation+0.000
Korea
Token Korea
Feature activation+0.000
Testing
Token Testing
Feature activation+0.000
a
Token a
Feature activation+0.000
Post
Token Post
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 3: Ultra low frequency cluster

TOP ACTIVATIONS
MAX = 0.585

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+0.585
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 14.158

the
Token the
Feature activation-0.025
team
Token team
Feature activation+0.000
has
Token has
Feature activation-0.010
signed
Token signed
Feature activation+0.004
forward
Token forward
Feature activation+0.011
Tyson
Token Tyson
Feature activation+14.158
J
Token J
Feature activation-0.166
ost
Tokenost
Feature activation-0.225
(
Token (
Feature activation-0.010
J
TokenJ
Feature activation-0.011
OH
TokenOH
Feature activation-0.073
been
Token been
Feature activation-0.168
proposing
Token proposing
Feature activation-0.015
don
Token don
Feature activation+0.064
't
Token't
Feature activation-0.051
reach
Token reach
Feature activation-0.962
,
Token,
Feature activation+0.329
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation-0.066
we
Token we
Feature activation-0.003
've
Token've
Feature activation+0.106
been
Token been
Feature activation-0.041
proposing
Token proposing
Feature activation-0.017
don
Token don
Feature activation+0.407
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
true
Token true
Feature activation-0.728
,
Token,
Feature activation+0.006
though
Token though
Feature activation-0.191
,
Token,
Feature activation+0.027
is
Token is
Feature activation-0.007
that
Token that
Feature activation+0.077
whatever
Token whatever
Feature activation-0.182
policy
Token policy
Feature activation-0.476
prescriptions
Token prescriptions
Feature activation-0.334
that
Token that
Feature activation-0.043
we
Token we
Feature activation+0.019
<|endoftext|>
Token<|endoftext|>
Feature activation-7.289
perspective
Token perspective
Feature activation-0.203
.
Token.
Feature activation-0.252
"
Token "
Feature activation+0.474
What
TokenWhat
Feature activation+0.380
is
Token is
Feature activation-0.177
true
Token true
Feature activation-0.463
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.540
perspective
Token perspective
Feature activation-0.073
.
Token.
Feature activation-0.133
"
Token "
Feature activation+0.971
What
TokenWhat
Feature activation+0.308
is
Token is
Feature activation-0.288
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.043
perspective
Token perspective
Feature activation-0.281
.
Token.
Feature activation+0.045
"
Token "
Feature activation+0.889
What
TokenWhat
Feature activation-0.389
is
Token is
Feature activation-0.509
true
Token true
Feature activation-1.484
,
Token,
Feature activation+0.071
though
Token though
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.068
that
Token that
Feature activation-0.075
we
Token we
Feature activation-0.114
've
Token've
Feature activation+0.008
been
Token been
Feature activation-0.116
proposing
Token proposing
Feature activation+0.362
don
Token don
Feature activation-0.043
't
Token't
Feature activation-0.394
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
,
Token,
Feature activation-0.013
is
Token is
Feature activation-0.052
that
Token that
Feature activation-0.514
whatever
Token whatever
Feature activation+0.016
policy
Token policy
Feature activation+0.082
prescriptions
Token prescriptions
Feature activation+0.442
that
Token that
Feature activation-0.234
we
Token we
Feature activation-0.185
've
Token've
Feature activation-0.450
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.943
perspective
Token perspective
Feature activation-0.038
.
Token.
Feature activation-0.323
"
Token "
Feature activation-0.029
What
TokenWhat
Feature activation+0.041
is
Token is
Feature activation-0.280
true
Token true
Feature activation-0.606
,
Token,
Feature activation+0.010
though
Token though
Feature activation-0.231
,
Token,
Feature activation-0.090
<|endoftext|>
Token<|endoftext|>
Feature activation-6.585
perspective
Token perspective
Feature activation+0.016
.
Token.
Feature activation-0.155
"
Token "
Feature activation+0.236
What
TokenWhat
Feature activation-0.056
is
Token is
Feature activation-0.375
true
Token true
Feature activation-0.620
,
Token,
Feature activation+0.031
though
Token though
Feature activation-0.173
<|endoftext|>
Token<|endoftext|>
Feature activation-7.072
perspective
Token perspective
Feature activation-0.033
.
Token.
Feature activation-0.214
"
Token "
Feature activation+0.082
What
TokenWhat
Feature activation+0.172
is
Token is
Feature activation-0.361
true
Token true
Feature activation-0.302
,
Token,
Feature activation+0.026
though
Token though
Feature activation-0.167
,
Token,
Feature activation-0.016
,
Token,
Feature activation-0.003
is
Token is
Feature activation-0.036
that
Token that
Feature activation-0.486
whatever
Token whatever
Feature activation+0.006
policy
Token policy
Feature activation+0.047
prescriptions
Token prescriptions
Feature activation+0.382
that
Token that
Feature activation-0.243
we
Token we
Feature activation-0.312
've
Token've
Feature activation-0.142
been
Token been
Feature activation-0.744
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.507
perspective
Token perspective
Feature activation+0.012
.
Token.
Feature activation-0.044
"
Token "
Feature activation+0.123
What
TokenWhat
Feature activation-0.057
is
Token is
Feature activation-0.185
true
Token true
Feature activation-0.108
,
Token,
Feature activation+0.015
though
Token though
Feature activation-0.074
prescriptions
Token prescriptions
Feature activation+0.235
that
Token that
Feature activation-0.061
we
Token we
Feature activation-0.200
've
Token've
Feature activation-0.031
been
Token been
Feature activation-0.161
proposing
Token proposing
Feature activation+0.578
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.225
perspective
Token perspective
Feature activation-0.535
.
Token.
Feature activation-0.232
"
Token "
Feature activation+0.188
What
TokenWhat
Feature activation-0.030
is
Token is
Feature activation-0.357
true
Token true
Feature activation-0.322
,
Token,
Feature activation+0.122
though
Token though
Feature activation-0.701
<|endoftext|>
Token<|endoftext|>
Feature activation-6.642
perspective
Token perspective
Feature activation-0.027
.
Token.
Feature activation-0.132
"
Token "
Feature activation+0.177
What
TokenWhat
Feature activation+0.036
is
Token is
Feature activation-0.208
true
Token true
Feature activation-0.259
,
Token,
Feature activation+0.040
though
Token though
Feature activation-0.181
<|endoftext|>
Token<|endoftext|>
Feature activation-6.580
perspective
Token perspective
Feature activation-0.059
.
Token.
Feature activation-0.089
"
Token "
Feature activation+0.458
What
TokenWhat
Feature activation-0.174
is
Token is
Feature activation-0.207
true
Token true
Feature activation-0.685
,
Token,
Feature activation+0.047
though
Token though
Feature activation-0.234
.
Token.
Feature activation-0.351
"
Token "
Feature activation-0.864
What
TokenWhat
Feature activation+0.153
is
Token is
Feature activation-0.237
true
Token true
Feature activation-0.360
,
Token,
Feature activation+0.173
though
Token though
Feature activation-0.904
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.315
perspective
Token perspective
Feature activation-0.119
.
Token.
Feature activation-0.184
"
Token "
Feature activation+0.127
What
TokenWhat
Feature activation-0.038
is
Token is
Feature activation-0.207
true
Token true
Feature activation-1.411
,
Token,
Feature activation+0.110
though
Token though
Feature activation-0.419

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.07

Head 2: 0.09

Head 3: 0.10

Head 4: 0.09

Head 5: 0.09

Head 6: 0.07

Head 7: 0.08

Head 8: 0.08

Head 9: 0.08

Head 10: 0.08

Head 11: 0.08

Positive logits

borgh3.53

Magikarp3.22

Vict3.07

Boeing3.00

ifest2.89

Faw2.85

FD2.82

arov2.80

SpaceX2.78

ospace2.78

Chelsea2.70

Tempest2.69

escription2.59

Lago2.58

Protector2.58

saf2.58

quished2.54

dq2.52

abouts2.51

phrine2.51

Negative logits

abbit-2.95

ITCH-2.88

arling-2.88

Rabbit-2.84

ritic-2.71

itability-2.68

ritis-2.64

dal-2.56

etting-2.56

urally-2.56

Hebrew-2.55

english-2.55

bred-2.54

Bok-2.53

Clicker-2.51

-2.45

grain-2.39

annot-2.38

Angus-2.38

-2.35

INTERVAL 0.526 - 0.585
CONTAINS 0.000%

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+0.585
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000

INTERVAL 0.468 - 0.526
CONTAINS 0.000%

INTERVAL 0.409 - 0.468
CONTAINS 0.000%

INTERVAL 0.351 - 0.409
CONTAINS 0.000%

INTERVAL 0.292 - 0.351
CONTAINS 0.000%

INTERVAL 0.234 - 0.292
CONTAINS 0.000%

INTERVAL 0.175 - 0.234
CONTAINS 0.000%

INTERVAL 0.117 - 0.175
CONTAINS 0.000%

INTERVAL 0.058 - 0.117
CONTAINS 0.000%

INTERVAL 0.000 - 0.058
CONTAINS 100.000%

job
Token job
Feature activation+0.000
to
Token to
Feature activation+0.000
bolster
Token bolster
Feature activation+0.000
headlines
Token headlines
Feature activation+0.000
.
Token.
Feature activation+0.000
On
Token On
Feature activation+0.000
the
Token the
Feature activation+0.000
other
Token other
Feature activation+0.000
hand
Token hand
Feature activation+0.000
,
Token,
Feature activation+0.000
her
Token her
Feature activation+0.000
7
Token 7
Feature activation+0.000
-
Token-
Feature activation+0.000
19
Token19
Feature activation+0.000
record
Token record
Feature activation+0.000
as
Token as
Feature activation+0.000
a
Token a
Feature activation+0.000
starter
Token starter
Feature activation+0.000
in
Token in
Feature activation+0.000
his
Token his
Feature activation+0.000
first
Token first
Feature activation+0.000
two
Token two
Feature activation+0.000
to
Token to
Feature activation+0.000
traditional
Token traditional
Feature activation+0.000
parenting
Token parenting
Feature activation+0.000
.
Token.
Feature activation+0.000
Of
Token Of
Feature activation+0.000
the
Token the
Feature activation+0.000
research
Token research
Feature activation+0.000
that
Token that
Feature activation+0.000
exists
Token exists
Feature activation+0.000
,
Token,
Feature activation+0.000
on
Token on
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
eastern
Token eastern
Feature activation+0.000
city
Token city
Feature activation+0.000
of
Token of
Feature activation+0.000
Ning
Token Ning
Feature activation+0.000
bo
Tokenbo
Feature activation+0.000
suspended
Token suspended
Feature activation+0.000
a
Token a
Feature activation+0.000
pet
Token pet
Feature activation+0.000
ro
Tokenro
Feature activation+0.000
insured
Tokeninsured
Feature activation+0.000
,
Token,
Feature activation+0.000
despite
Token despite
Feature activation+0.000
paying
Token paying
Feature activation+0.000
the
Token the
Feature activation+0.000
highest
Token highest
Feature activation+0.000
prices
Token prices
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
world
Token world
Feature activation+0.000
for
Token for
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 4: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.442

whatever
Token whatever
Feature activation-0.691
policy
Token policy
Feature activation-0.214
prescriptions
Token prescriptions
Feature activation-0.323
that
Token that
Feature activation-0.429
we
Token we
Feature activation+0.009
've
Token've
Feature activation+0.189
been
Token been
Feature activation-0.271
proposing
Token proposing
Feature activation-0.487
don
Token don
Feature activation-0.208
't
Token't
Feature activation-0.465
reach
Token reach
Feature activation-3.713
<|endoftext|>
Token<|endoftext|>
Feature activation-6.600
perspective
Token perspective
Feature activation-0.050
.
Token.
Feature activation-0.024
"
Token "
Feature activation+0.263
What
TokenWhat
Feature activation-0.756
is
Token is
Feature activation-0.362
true
Token true
Feature activation-0.623
,
Token,
Feature activation-0.067
though
Token though
Feature activation-0.454
prescriptions
Token prescriptions
Feature activation-0.099
that
Token that
Feature activation-0.306
we
Token we
Feature activation-0.010
've
Token've
Feature activation+0.018
been
Token been
Feature activation-0.134
proposing
Token proposing
Feature activation+0.083
don
Token don
Feature activation-1.112
't
Token't
Feature activation-0.607
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.540
policy
Token policy
Feature activation-0.734
prescriptions
Token prescriptions
Feature activation-0.363
that
Token that
Feature activation-0.250
we
Token we
Feature activation+0.073
've
Token've
Feature activation+0.119
been
Token been
Feature activation-0.086
proposing
Token proposing
Feature activation-0.227
don
Token don
Feature activation-0.180
't
Token't
Feature activation-0.207
reach
Token reach
Feature activation-2.328
<|endoftext|>
Token<|endoftext|>
Feature activation-8.935
perspective
Token perspective
Feature activation+0.027
.
Token.
Feature activation-0.300
"
Token "
Feature activation+0.415
What
TokenWhat
Feature activation-0.898
is
Token is
Feature activation-0.704
true
Token true
Feature activation-1.253
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-9.224
perspective
Token perspective
Feature activation-0.086
.
Token.
Feature activation-0.255
"
Token "
Feature activation+0.934
What
TokenWhat
Feature activation-0.809
is
Token is
Feature activation-0.848
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.412
perspective
Token perspective
Feature activation-0.271
.
Token.
Feature activation+0.022
"
Token "
Feature activation+1.442
What
TokenWhat
Feature activation-1.463
is
Token is
Feature activation-0.893
true
Token true
Feature activation-2.076
,
Token,
Feature activation-0.372
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.062
perspective
Token perspective
Feature activation-0.121
.
Token.
Feature activation+0.456
"
Token "
Feature activation+0.817
What
TokenWhat
Feature activation-0.978
is
Token is
Feature activation-0.470
true
Token true
Feature activation-0.571
,
Token,
Feature activation-0.342
though
Token though
Feature activation-2.949
<|endoftext|>
Token<|endoftext|>
Feature activation-9.210
perspective
Token perspective
Feature activation-0.014
.
Token.
Feature activation-0.099
"
Token "
Feature activation+0.108
What
TokenWhat
Feature activation-0.213
is
Token is
Feature activation-0.348
true
Token true
Feature activation-0.230
,
Token,
Feature activation-0.065
though
Token though
Feature activation-0.251
<|endoftext|>
Token<|endoftext|>
Feature activation-8.061
perspective
Token perspective
Feature activation-0.050
.
Token.
Feature activation-0.074
"
Token "
Feature activation+0.250
What
TokenWhat
Feature activation-0.254
is
Token is
Feature activation-0.358
true
Token true
Feature activation-0.464
,
Token,
Feature activation-0.112
though
Token though
Feature activation-0.584
<|endoftext|>
Token<|endoftext|>
Feature activation-8.715
perspective
Token perspective
Feature activation-0.072
.
Token.
Feature activation-0.227
"
Token "
Feature activation+0.112
What
TokenWhat
Feature activation-0.171
is
Token is
Feature activation-0.574
true
Token true
Feature activation-0.592
,
Token,
Feature activation-0.109
though
Token though
Feature activation-0.597
<|endoftext|>
Token<|endoftext|>
Feature activation-7.121
perspective
Token perspective
Feature activation-0.067
.
Token.
Feature activation-0.270
"
Token "
Feature activation+0.168
What
TokenWhat
Feature activation-0.336
is
Token is
Feature activation-0.469
true
Token true
Feature activation-0.713
,
Token,
Feature activation-0.118
though
Token though
Feature activation-0.806
,
Token,
Feature activation-0.137
is
Token is
Feature activation-0.161
that
Token that
Feature activation-0.713
whatever
Token whatever
Feature activation-0.443
policy
Token policy
Feature activation-0.455
prescriptions
Token prescriptions
Feature activation+0.097
that
Token that
Feature activation-0.588
we
Token we
Feature activation-0.110
've
Token've
Feature activation-0.031
been
Token been
Feature activation-0.880
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.147
is
Token is
Feature activation-0.215
that
Token that
Feature activation-0.872
whatever
Token whatever
Feature activation-0.478
policy
Token policy
Feature activation-0.398
prescriptions
Token prescriptions
Feature activation+0.191
that
Token that
Feature activation-0.718
we
Token we
Feature activation+0.013
've
Token've
Feature activation-0.417
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.049
that
Token that
Feature activation-0.539
we
Token we
Feature activation-0.094
've
Token've
Feature activation+0.061
been
Token been
Feature activation-0.165
proposing
Token proposing
Feature activation+0.226
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.290
policy
Token policy
Feature activation-0.093
prescriptions
Token prescriptions
Feature activation-0.317
that
Token that
Feature activation-0.098
we
Token we
Feature activation+0.002
've
Token've
Feature activation+0.116
been
Token been
Feature activation-0.045
proposing
Token proposing
Feature activation-0.049
don
Token don
Feature activation-5.748
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.021
perspective
Token perspective
Feature activation+0.049
.
Token.
Feature activation-0.063
"
Token "
Feature activation+0.278
What
TokenWhat
Feature activation-0.470
is
Token is
Feature activation-0.588
true
Token true
Feature activation-1.041
,
Token,
Feature activation-0.193
though
Token though
Feature activation-0.564
<|endoftext|>
Token<|endoftext|>
Feature activation-7.996
perspective
Token perspective
Feature activation+0.064
.
Token.
Feature activation-0.082
"
Token "
Feature activation+0.684
What
TokenWhat
Feature activation-0.561
is
Token is
Feature activation-0.419
true
Token true
Feature activation-0.815
,
Token,
Feature activation-0.293
though
Token though
Feature activation-0.836
<|endoftext|>
Token<|endoftext|>
Feature activation-7.582
perspective
Token perspective
Feature activation+0.084
.
Token.
Feature activation-0.161
"
Token "
Feature activation+0.377
What
TokenWhat
Feature activation-1.259
is
Token is
Feature activation-0.605
true
Token true
Feature activation-0.889
,
Token,
Feature activation-0.235
though
Token though
Feature activation-2.132
<|endoftext|>
Token<|endoftext|>
Feature activation-7.700
perspective
Token perspective
Feature activation+0.029
.
Token.
Feature activation+0.147
"
Token "
Feature activation+0.714
What
TokenWhat
Feature activation-0.680
is
Token is
Feature activation-0.460
true
Token true
Feature activation-1.735
,
Token,
Feature activation-0.294
though
Token though
Feature activation-1.403

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.07

Head 2: 0.09

Head 3: 0.08

Head 4: 0.09

Head 5: 0.08

Head 6: 0.08

Head 7: 0.09

Head 8: 0.08

Head 9: 0.08

Head 10: 0.08

Head 11: 0.09

Positive logits

NES3.42

Condition2.71

HOU2.63

MIN2.57

NETWORK2.55

Russ2.51

Kun2.50

haul2.45

FUN2.44

arrang2.40

Jenn2.40

ISION2.37

PN2.36

GGGG2.36

NES2.29

EGIN2.27

GS2.26

laun2.25

Ending2.25

======2.24

Negative logits

Strait-3.30

Rider-3.06

Rect-3.00

Ruler-2.98

tle-2.94

Diaz-2.69

Darth-2.68

Swordsman-2.66

imble-2.60

caster-2.55

Skywalker-2.54

Kenobi-2.54

Obi-2.50

Vader-2.50

igon-2.48

fts-2.43

qt-2.42

troop-2.42

heir-2.41

Butler-2.39

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

construction
Token construction
Feature activation+0.000
workers
Token workers
Feature activation+0.000
from
Token from
Feature activation+0.000
Los
Token Los
Feature activation+0.000
Angeles
Token Angeles
Feature activation+0.000
who
Token who
Feature activation+0.000
specialize
Token specialize
Feature activation+0.000
in
Token in
Feature activation+0.000
set
Token set
Feature activation+0.000
design
Token design
Feature activation+0.000
.
Token.
Feature activation+0.000
ates
Tokenates
Feature activation+0.000
reportedly
Token reportedly
Feature activation+0.000
intervened
Token intervened
Feature activation+0.000
,
Token,
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
according
Token according
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
L
Token L
Feature activation+0.000
.
Token.
Feature activation+0.000
presentation
Token presentation
Feature activation+0.000
or
Token or
Feature activation+0.000
content
Token content
Feature activation+0.000
.
Token.
Feature activation+0.000
This
Token This
Feature activation+0.000
argument
Token argument
Feature activation+0.000
has
Token has
Feature activation+0.000
covered
Token covered
Feature activation+0.000
everything
Token everything
Feature activation+0.000
from
Token from
Feature activation+0.000
R
Token R
Feature activation+0.000
used
Token used
Feature activation+0.000
Twitter
Token Twitter
Feature activation+0.000
to
Token to
Feature activation+0.000
help
Token help
Feature activation+0.000
escape
Token escape
Feature activation+0.000
from
Token from
Feature activation+0.000
a
Token a
Feature activation+0.000
locked
Token locked
Feature activation+0.000
bathroom
Token bathroom
Feature activation+0.000
stall
Token stall
Feature activation+0.000
on
Token on
Feature activation+0.000
U
Token U
Feature activation+0.000
mar
Tokenmar
Feature activation+0.000
ov
Tokenov
Feature activation+0.000
warned
Token warned
Feature activation+0.000
in
Token in
Feature activation+0.000
an
Token an
Feature activation+0.000
interview
Token interview
Feature activation+0.000
on
Token on
Feature activation+0.000
a
Token a
Feature activation+0.000
rebel
Token rebel
Feature activation+0.000
-
Token-
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 5: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.510

<|endoftext|>
Token<|endoftext|>
Feature activation-4.429
perspective
Token perspective
Feature activation-0.032
.
Token.
Feature activation-0.085
"
Token "
Feature activation+0.151
What
TokenWhat
Feature activation-0.158
is
Token is
Feature activation+0.001
true
Token true
Feature activation-0.661
,
Token,
Feature activation+0.102
though
Token though
Feature activation-0.031
<|endoftext|>
Token<|endoftext|>
Feature activation-4.222
perspective
Token perspective
Feature activation-0.002
.
Token.
Feature activation+0.011
"
Token "
Feature activation+0.306
What
TokenWhat
Feature activation-0.302
is
Token is
Feature activation-0.031
true
Token true
Feature activation-0.287
,
Token,
Feature activation+0.154
though
Token though
Feature activation-0.085
prescriptions
Token prescriptions
Feature activation+0.060
that
Token that
Feature activation-0.012
we
Token we
Feature activation+0.039
've
Token've
Feature activation+0.014
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.427
don
Token don
Feature activation-0.553
't
Token't
Feature activation-0.145
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.410
,
Token,
Feature activation+0.084
though
Token though
Feature activation-0.154
,
Token,
Feature activation-0.063
is
Token is
Feature activation+0.125
that
Token that
Feature activation+0.432
whatever
Token whatever
Feature activation-0.132
policy
Token policy
Feature activation-0.254
prescriptions
Token prescriptions
Feature activation-0.189
that
Token that
Feature activation-0.027
we
Token we
Feature activation+0.047
<|endoftext|>
Token<|endoftext|>
Feature activation-5.811
perspective
Token perspective
Feature activation-0.000
.
Token.
Feature activation-0.191
"
Token "
Feature activation+0.500
What
TokenWhat
Feature activation+0.271
is
Token is
Feature activation+0.868
true
Token true
Feature activation-0.107
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.979
perspective
Token perspective
Feature activation+0.051
.
Token.
Feature activation+0.011
"
Token "
Feature activation+1.265
What
TokenWhat
Feature activation+0.261
is
Token is
Feature activation+0.503
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.695
perspective
Token perspective
Feature activation-0.128
.
Token.
Feature activation+0.107
"
Token "
Feature activation+1.510
What
TokenWhat
Feature activation-0.268
is
Token is
Feature activation+0.112
true
Token true
Feature activation-0.794
,
Token,
Feature activation+0.244
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.093
.
Token.
Feature activation-0.325
"
Token "
Feature activation-1.081
What
TokenWhat
Feature activation+0.010
is
Token is
Feature activation+0.164
true
Token true
Feature activation+0.617
,
Token,
Feature activation+0.138
though
Token though
Feature activation-0.519
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.025
is
Token is
Feature activation+0.084
that
Token that
Feature activation-0.046
whatever
Token whatever
Feature activation-0.087
policy
Token policy
Feature activation-0.096
prescriptions
Token prescriptions
Feature activation+0.766
that
Token that
Feature activation-0.258
we
Token we
Feature activation-0.030
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.139
perspective
Token perspective
Feature activation-0.018
.
Token.
Feature activation-0.036
"
Token "
Feature activation+0.337
What
TokenWhat
Feature activation+0.037
is
Token is
Feature activation+0.189
true
Token true
Feature activation-0.139
,
Token,
Feature activation+0.171
though
Token though
Feature activation-0.139
true
Token true
Feature activation-0.179
,
Token,
Feature activation+0.160
though
Token though
Feature activation-0.130
,
Token,
Feature activation-0.077
is
Token is
Feature activation+0.118
that
Token that
Feature activation+0.265
whatever
Token whatever
Feature activation+0.085
policy
Token policy
Feature activation+0.202
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
true
Token true
Feature activation-0.349
,
Token,
Feature activation+0.164
though
Token though
Feature activation-0.194
,
Token,
Feature activation-0.141
is
Token is
Feature activation+0.133
that
Token that
Feature activation+0.459
whatever
Token whatever
Feature activation-0.094
policy
Token policy
Feature activation+0.214
prescriptions
Token prescriptions
Feature activation-0.006
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
,
Token,
Feature activation-0.041
is
Token is
Feature activation+0.116
that
Token that
Feature activation-0.069
whatever
Token whatever
Feature activation-0.098
policy
Token policy
Feature activation-0.066
prescriptions
Token prescriptions
Feature activation+1.119
that
Token that
Feature activation-0.146
we
Token we
Feature activation-0.004
've
Token've
Feature activation-0.039
been
Token been
Feature activation-0.381
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.063
is
Token is
Feature activation+0.079
that
Token that
Feature activation-0.053
whatever
Token whatever
Feature activation-0.099
policy
Token policy
Feature activation+0.008
prescriptions
Token prescriptions
Feature activation+1.157
that
Token that
Feature activation-0.213
we
Token we
Feature activation+0.109
've
Token've
Feature activation-0.260
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.396
that
Token that
Feature activation-0.132
we
Token we
Feature activation+0.035
've
Token've
Feature activation+0.048
been
Token been
Feature activation-0.018
proposing
Token proposing
Feature activation+0.710
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.097
policy
Token policy
Feature activation-0.037
prescriptions
Token prescriptions
Feature activation-0.201
that
Token that
Feature activation-0.042
we
Token we
Feature activation+0.005
've
Token've
Feature activation+0.067
been
Token been
Feature activation-0.015
proposing
Token proposing
Feature activation-0.024
don
Token don
Feature activation-4.804
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.115
perspective
Token perspective
Feature activation+0.089
.
Token.
Feature activation-0.014
"
Token "
Feature activation+0.409
What
TokenWhat
Feature activation-0.025
is
Token is
Feature activation+0.177
true
Token true
Feature activation-0.341
,
Token,
Feature activation+0.202
though
Token though
Feature activation-0.104
<|endoftext|>
Token<|endoftext|>
Feature activation-5.124
perspective
Token perspective
Feature activation+0.061
.
Token.
Feature activation-0.017
"
Token "
Feature activation+0.748
What
TokenWhat
Feature activation-0.146
is
Token is
Feature activation+0.227
true
Token true
Feature activation-0.224
,
Token,
Feature activation+0.184
though
Token though
Feature activation-0.172
perspective
Token perspective
Feature activation-0.157
.
Token.
Feature activation-0.176
"
Token "
Feature activation+0.334
What
TokenWhat
Feature activation-0.095
is
Token is
Feature activation+0.214
true
Token true
Feature activation+0.523
,
Token,
Feature activation+0.261
though
Token though
Feature activation-0.509
,
Token,
Feature activation-0.490
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.006
perspective
Token perspective
Feature activation+0.021
.
Token.
Feature activation-0.177
"
Token "
Feature activation+0.338
What
TokenWhat
Feature activation+0.019
is
Token is
Feature activation+0.389
true
Token true
Feature activation-0.649
,
Token,
Feature activation+0.264
though
Token though
Feature activation-0.293
,
Token,
Feature activation-0.216
is
Token is
Feature activation+0.164

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.09

Head 2: 0.08

Head 3: 0.07

Head 4: 0.08

Head 5: 0.08

Head 6: 0.08

Head 7: 0.08

Head 8: 0.09

Head 9: 0.07

Head 10: 0.09

Head 11: 0.10

Positive logits

aucas3.32

Afric3.15

ugu2.86

cous2.73

incial2.69

Op2.67

aft2.61

om2.61

Tas2.54

intellig2.52

telesc2.51

guided2.50

Alger2.50

targ2.48

finder2.46

miscar2.44

xual2.43

abduction2.42

iru2.41

enh2.39

Negative logits

Marino-2.95

cffffcc-2.65

Schiff-2.60

Mellon-2.56

coon-2.55

antha-2.53

emonium-2.50

Meow-2.49

vents-2.47

Dough-2.46

currency-2.45

forum-2.42

razil-2.42

Bowser-2.39

ATTLE-2.39

fumes-2.37

Schumer-2.35

eteria-2.35

Pace-2.35

omination-2.33

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

amount
Token amount
Feature activation+0.000
of
Token of
Feature activation+0.000
time
Token time
Feature activation+0.000
at
Token at
Feature activation+0.000
the
Token the
Feature activation+0.000
beginning
Token beginning
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
movie
Token movie
Feature activation+0.000
showing
Token showing
Feature activation+0.000
that
Token that
Feature activation+0.000
conferences
Token conferences
Feature activation+0.000
and
Token and
Feature activation+0.000
conventions
Token conventions
Feature activation+0.000
around
Token around
Feature activation+0.000
the
Token the
Feature activation+0.000
country
Token country
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
paid
Token paid
Feature activation+0.000
her
Token her
Feature activation+0.000
about
Token about
Feature activation+0.000
has
Token has
Feature activation+0.000
been
Token been
Feature activation+0.000
overshadowed
Token overshadowed
Feature activation+0.000
by
Token by
Feature activation+0.000
her
Token her
Feature activation+0.000
own
Token own
Feature activation+0.000
daughter
Token daughter
Feature activation+0.000
,
Token,
Feature activation+0.000
Bristol
Token Bristol
Feature activation+0.000
.
Token.
Feature activation+0.000
Last
Token Last
Feature activation+0.000
to
Token to
Feature activation+0.000
play
Token play
Feature activation+0.000
against
Token against
Feature activation+0.000
.
Token.
Feature activation+0.000
I
Token I
Feature activation+0.000
don
Token don
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
think
Token think
Feature activation+0.000
they
Token they
Feature activation+0.000
might
Token might
Feature activation+0.000
raise
Token raise
Feature activation+0.000
the
Token the
Feature activation+0.000
rates
Token rates
Feature activation+0.000
.
Token.
Feature activation+0.000
With
Token With
Feature activation+0.000
the
Token the
Feature activation+0.000
mold
Token mold
Feature activation+0.000
,
Token,
Feature activation+0.000
this
Token this
Feature activation+0.000
has
Token has
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 6: Ultra low frequency cluster

TOP ACTIVATIONS
MAX = 0.068

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+0.068
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 16.964

the
Token the
Feature activation-0.043
team
Token team
Feature activation-0.016
has
Token has
Feature activation-0.014
signed
Token signed
Feature activation-0.023
forward
Token forward
Feature activation+0.004
Tyson
Token Tyson
Feature activation+16.964
J
Token J
Feature activation-0.118
ost
Tokenost
Feature activation-0.257
(
Token (
Feature activation+0.007
J
TokenJ
Feature activation-0.058
OH
TokenOH
Feature activation-0.082
<|endoftext|>
Token<|endoftext|>
Feature activation-5.999
perspective
Token perspective
Feature activation-0.047
.
Token.
Feature activation-0.083
"
Token "
Feature activation+0.297
What
TokenWhat
Feature activation-0.608
is
Token is
Feature activation-0.299
true
Token true
Feature activation-0.542
,
Token,
Feature activation-0.044
though
Token though
Feature activation-0.272
whatever
Token whatever
Feature activation-0.238
policy
Token policy
Feature activation-0.072
prescriptions
Token prescriptions
Feature activation-0.292
that
Token that
Feature activation-0.068
we
Token we
Feature activation-0.003
've
Token've
Feature activation+0.114
been
Token been
Feature activation-0.042
proposing
Token proposing
Feature activation-0.021
don
Token don
Feature activation-5.855
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
true
Token true
Feature activation-0.751
,
Token,
Feature activation-0.049
though
Token though
Feature activation-0.423
,
Token,
Feature activation-0.051
is
Token is
Feature activation-0.094
that
Token that
Feature activation+0.298
whatever
Token whatever
Feature activation-0.321
policy
Token policy
Feature activation-0.501
prescriptions
Token prescriptions
Feature activation-0.328
that
Token that
Feature activation+0.048
we
Token we
Feature activation+0.037
<|endoftext|>
Token<|endoftext|>
Feature activation-8.211
perspective
Token perspective
Feature activation-0.162
.
Token.
Feature activation-0.447
"
Token "
Feature activation+0.570
What
TokenWhat
Feature activation-0.100
is
Token is
Feature activation-0.121
true
Token true
Feature activation-0.858
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.456
perspective
Token perspective
Feature activation-0.154
.
Token.
Feature activation-0.330
"
Token "
Feature activation+1.500
What
TokenWhat
Feature activation-0.107
is
Token is
Feature activation-0.395
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.719
perspective
Token perspective
Feature activation-0.367
.
Token.
Feature activation-0.118
"
Token "
Feature activation+1.778
What
TokenWhat
Feature activation-0.871
is
Token is
Feature activation-0.606
true
Token true
Feature activation-1.479
,
Token,
Feature activation-0.078
though
Token though
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.071
that
Token that
Feature activation+0.001
we
Token we
Feature activation-0.140
've
Token've
Feature activation+0.053
been
Token been
Feature activation-0.090
proposing
Token proposing
Feature activation+0.446
don
Token don
Feature activation-0.997
't
Token't
Feature activation-0.149
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
,
Token,
Feature activation-0.102
is
Token is
Feature activation-0.133
that
Token that
Feature activation-0.639
whatever
Token whatever
Feature activation-0.247
policy
Token policy
Feature activation+0.024
prescriptions
Token prescriptions
Feature activation+0.647
that
Token that
Feature activation-0.248
we
Token we
Feature activation-0.316
've
Token've
Feature activation-0.348
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.529
perspective
Token perspective
Feature activation-0.066
.
Token.
Feature activation-0.354
"
Token "
Feature activation+0.239
What
TokenWhat
Feature activation-0.028
is
Token is
Feature activation-0.362
true
Token true
Feature activation-0.688
,
Token,
Feature activation-0.045
though
Token though
Feature activation-0.489
<|endoftext|>
Token<|endoftext|>
Feature activation-7.292
perspective
Token perspective
Feature activation+0.006
.
Token.
Feature activation-0.264
"
Token "
Feature activation+0.449
What
TokenWhat
Feature activation-0.235
is
Token is
Feature activation-0.461
true
Token true
Feature activation-0.892
,
Token,
Feature activation-0.050
though
Token though
Feature activation-0.323
<|endoftext|>
Token<|endoftext|>
Feature activation-7.838
perspective
Token perspective
Feature activation-0.079
.
Token.
Feature activation-0.254
"
Token "
Feature activation+0.194
What
TokenWhat
Feature activation+0.127
is
Token is
Feature activation-0.481
true
Token true
Feature activation-0.548
,
Token,
Feature activation-0.055
though
Token though
Feature activation-0.351
,
Token,
Feature activation-0.105
is
Token is
Feature activation-0.101
that
Token that
Feature activation-0.587
whatever
Token whatever
Feature activation-0.238
policy
Token policy
Feature activation-0.102
prescriptions
Token prescriptions
Feature activation+0.549
that
Token that
Feature activation-0.215
we
Token we
Feature activation-0.337
've
Token've
Feature activation+0.034
been
Token been
Feature activation-0.667
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.370
perspective
Token perspective
Feature activation-0.021
.
Token.
Feature activation-0.110
"
Token "
Feature activation+0.276
What
TokenWhat
Feature activation-0.132
is
Token is
Feature activation-0.268
true
Token true
Feature activation-0.219
,
Token,
Feature activation-0.047
though
Token though
Feature activation-0.149
prescriptions
Token prescriptions
Feature activation+0.349
that
Token that
Feature activation-0.037
we
Token we
Feature activation-0.203
've
Token've
Feature activation+0.056
been
Token been
Feature activation-0.127
proposing
Token proposing
Feature activation+0.827
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.946
perspective
Token perspective
Feature activation-0.304
.
Token.
Feature activation-0.365
"
Token "
Feature activation+0.310
What
TokenWhat
Feature activation-0.563
is
Token is
Feature activation-0.299
true
Token true
Feature activation-0.491
,
Token,
Feature activation-0.017
though
Token though
Feature activation-1.265
<|endoftext|>
Token<|endoftext|>
Feature activation-7.347
perspective
Token perspective
Feature activation-0.062
.
Token.
Feature activation-0.214
"
Token "
Feature activation+0.347
What
TokenWhat
Feature activation-0.084
is
Token is
Feature activation-0.248
true
Token true
Feature activation-0.451
,
Token,
Feature activation-0.052
though
Token though
Feature activation-0.357
<|endoftext|>
Token<|endoftext|>
Feature activation-7.311
perspective
Token perspective
Feature activation-0.025
.
Token.
Feature activation-0.206
"
Token "
Feature activation+0.880
What
TokenWhat
Feature activation-0.386
is
Token is
Feature activation-0.280
true
Token true
Feature activation-0.601
,
Token,
Feature activation-0.080
though
Token though
Feature activation-0.489
.
Token.
Feature activation-0.606
"
Token "
Feature activation-2.284
What
TokenWhat
Feature activation-0.366
is
Token is
Feature activation-0.254
true
Token true
Feature activation-0.382
,
Token,
Feature activation+0.053
though
Token though
Feature activation-1.728
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.064
perspective
Token perspective
Feature activation-0.119
.
Token.
Feature activation-0.453
"
Token "
Feature activation+0.160
What
TokenWhat
Feature activation-0.278
is
Token is
Feature activation-0.161
true
Token true
Feature activation-1.342
,
Token,
Feature activation-0.037
though
Token though
Feature activation-0.835

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.07

Head 2: 0.08

Head 3: 0.10

Head 4: 0.08

Head 5: 0.08

Head 6: 0.08

Head 7: 0.08

Head 8: 0.08

Head 9: 0.08

Head 10: 0.09

Head 11: 0.08

Positive logits

bloom2.80

scroll2.71

HUD2.48

locks2.45

aret2.43

hack2.40

gall2.40

atts2.38

orn2.37

inspector2.37

jew2.37

tip2.34

scrolls2.34

busted2.31

iddler2.25

stole2.25

auditor2.25

penetrated2.24

wake2.24

jew2.23

Negative logits

distribut-2.94

ç-2.92

Dum-2.89

æ-2.87

icio-2.87

ciating-2.86

Proud-2.85

-2.83

DeL-2.67

Abs-2.67

Eps-2.63

-2.61

hyde-2.59

Beau-2.56

oute-2.55

ヘラ-2.53

atus-2.51

Ther-2.50

Cla-2.48

Ale-2.47

INTERVAL 0.061 - 0.068
CONTAINS 0.000%

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+0.068
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000

INTERVAL 0.055 - 0.061
CONTAINS 0.000%

INTERVAL 0.048 - 0.055
CONTAINS 0.000%

INTERVAL 0.041 - 0.048
CONTAINS 0.000%

INTERVAL 0.034 - 0.041
CONTAINS 0.000%

INTERVAL 0.027 - 0.034
CONTAINS 0.000%

INTERVAL 0.020 - 0.027
CONTAINS 0.000%

INTERVAL 0.014 - 0.020
CONTAINS 0.000%

INTERVAL 0.007 - 0.014
CONTAINS 0.000%

INTERVAL 0.000 - 0.007
CONTAINS 100.000%

<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
,
Token,
Feature activation+0.000
then
Token then
Feature activation+0.000
give
Token give
Feature activation+0.000
the
Token the
Feature activation+0.000
model
Token model
Feature activation+0.000
the
Token the
Feature activation+0.000
reference
Token reference
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
view
Token view
Feature activation+0.000
Post
Token Post
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
Cleveland
Token Cleveland
Feature activation+0.000
looks
Token looks
Feature activation+0.000
like
Token like
Feature activation+0.000
as
Token as
Feature activation+0.000
it
Token it
Feature activation+0.000
prepares
Token prepares
Feature activation+0.000
for
Token for
Feature activation+0.000
payment
Token payment
Feature activation+0.000
issues
Token issues
Feature activation+0.000
since
Token since
Feature activation+0.000
August
Token August
Feature activation+0.000
4
Token 4
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Top
TokenTop
Feature activation+0.000
hospital
Token hospital
Feature activation+0.000
doctors
Token doctors
Feature activation+0.000
it
Token it
Feature activation+0.000
simply
Token simply
Feature activation+0.000
downloads
Token downloads
Feature activation+0.000
a
Token a
Feature activation+0.000
malicious
Token malicious
Feature activation+0.000
script
Token script
Feature activation+0.000
.
Token.
Feature activation+0.000
This
Token This
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
nasty
Token nasty
Feature activation+0.000
as
Token as
Feature activation+0.000
it
Token it
Feature activation+0.000
ran
Token ran
Feature activation+0.000
an
Token an
Feature activation+0.000
exclusive
Token exclusive
Feature activation+0.000
interview
Token interview
Feature activation+0.000
with
Token with
Feature activation+0.000
her
Token her
Feature activation+0.000
back
Token back
Feature activation+0.000
in
Token in
Feature activation+0.000
December
Token December
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 7: In texts about Israeli–Palestinian conflict

TOP ACTIVATIONS
MAX = 6.609

built
Token built
Feature activation+1.400
on
Token on
Feature activation+2.945
occupied
Token occupied
Feature activation+4.053
Palestinian
Token Palestinian
Feature activation+4.452
and
Token and
Feature activation+3.665
Syrian
Token Syrian
Feature activation+6.273
land
Token land
Feature activation+1.739
in
Token in
Feature activation+3.486
violation
Token violation
Feature activation+1.277
of
Token of
Feature activation+2.519
international
Token international
Feature activation+2.857
,
Token,
Feature activation+1.808
which
Token which
Feature activation+2.355
Israel
Token Israel
Feature activation+3.803
took
Token took
Feature activation+3.371
in
Token in
Feature activation+4.552
the
Token the
Feature activation+5.867
1967
Token 1967
Feature activation+3.196
war
Token war
Feature activation+4.683
and
Token and
Feature activation+4.550
later
Token later
Feature activation+6.609
annexed
Token annexed
Feature activation+2.402
Israel
Token Israel
Feature activation+2.869
halted
Token halted
Feature activation+4.607
settlement
Token settlement
Feature activation+3.360
building
Token building
Feature activation+1.922
in
Token in
Feature activation+4.380
the
Token the
Feature activation+5.707
West
Token West
Feature activation+0.338
Bank
Token Bank
Feature activation+3.399
and
Token and
Feature activation+5.071
East
Token East
Feature activation+1.958
Jerusalem
Token Jerusalem
Feature activation+3.424
and
Token and
Feature activation+4.444
a
Token a
Feature activation+3.318
limited
Token limited
Feature activation+2.976
one
Token one
Feature activation+2.647
into
Token into
Feature activation+5.084
the
Token the
Feature activation+5.689
Gaza
Token Gaza
Feature activation+3.249
Strip
Token Strip
Feature activation+2.414
,
Token,
Feature activation+2.938
during
Token during
Feature activation+2.374
which
Token which
Feature activation+2.612
hold
Token hold
Feature activation+2.273
talks
Token talks
Feature activation+0.994
to
Token to
Feature activation+1.931
calm
Token calm
Feature activation+2.874
the
Token the
Feature activation+5.025
recent
Token recent
Feature activation+5.518
surge
Token surge
Feature activation+1.169
of
Token of
Feature activation+4.745
violence
Token violence
Feature activation+1.235
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
in
Token in
Feature activation+3.601
the
Token the
Feature activation+4.125
1967
Token 1967
Feature activation+2.604
M
Token M
Feature activation+1.328
ide
Tokenide
Feature activation+0.016
ast
Tokenast
Feature activation+5.440
war
Token war
Feature activation+3.031
.
Token.
Feature activation+1.315
Sweet
Token Sweet
Feature activation+0.763
ening
Tokenening
Feature activation+1.308
the
Token the
Feature activation+1.039
's
Token's
Feature activation+3.005
decision
Token decision
Feature activation+1.796
to
Token to
Feature activation+3.270
set
Token set
Feature activation+3.272
up
Token up
Feature activation+3.969
added
Token added
Feature activation+5.354
security
Token security
Feature activation+2.633
measures
Token measures
Feature activation+1.385
at
Token at
Feature activation+3.593
the
Token the
Feature activation+4.026
Temple
Token Temple
Feature activation+1.638
protests
Token protests
Feature activation+2.293
against
Token against
Feature activation+4.108
Jewish
Token Jewish
Feature activation+4.563
visitors
Token visitors
Feature activation+3.206
to
Token to
Feature activation+3.954
the
Token the
Feature activation+5.316
compound
Token compound
Feature activation+2.253
and
Token and
Feature activation+4.737
Israeli
Token Israeli
Feature activation+4.913
politicians
Token politicians
Feature activation+2.657
calling
Token calling
Feature activation+3.505
for
Token for
Feature activation+3.559
560
Token 560
Feature activation+3.651
new
Token new
Feature activation+5.127
homes
Token homes
Feature activation+2.447
in
Token in
Feature activation+4.323
the
Token the
Feature activation+5.293
West
Token West
Feature activation+0.011
Bank
Token Bank
Feature activation+3.924
settlement
Token settlement
Feature activation+3.469
of
Token of
Feature activation+4.707
Ma
Token Ma
Feature activation+1.683
for
Token for
Feature activation+4.240
Jews
Token Jews
Feature activation+2.349
to
Token to
Feature activation+3.692
be
Token be
Feature activation+4.364
allowed
Token allowed
Feature activation+3.721
to
Token to
Feature activation+5.241
pray
Token pray
Feature activation+2.890
there
Token there
Feature activation+1.604
.
Token.
Feature activation+1.031
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.808
,
Token,
Feature activation+1.927
which
Token which
Feature activation+1.171
were
Token were
Feature activation+2.992
conquered
Token conquered
Feature activation+2.471
in
Token in
Feature activation+3.204
the
Token the
Feature activation+5.238
1967
Token 1967
Feature activation+2.929
war
Token war
Feature activation+2.836
.
Token.
Feature activation+0.470
The
Token The
Feature activation+2.838
British
Token British
Feature activation+2.435
truce
Token truce
Feature activation+1.721
after
Token after
Feature activation+2.264
a
Token a
Feature activation+3.423
week
Token week
Feature activation+1.557
of
Token of
Feature activation+3.811
fierce
Token fierce
Feature activation+5.166
fighting
Token fighting
Feature activation+1.771
in
Token in
Feature activation+4.773
Gaza
Token Gaza
Feature activation+2.666
.
Token.
Feature activation+0.619
Ċ
TokenĊ
Feature activation+0.000
and
Token and
Feature activation+3.937
ease
Token ease
Feature activation+4.065
security
Token security
Feature activation+2.956
restrictions
Token restrictions
Feature activation+0.926
in
Token in
Feature activation+4.230
the
Token the
Feature activation+5.162
occupied
Token occupied
Feature activation+4.575
West
Token West
Feature activation+0.352
Bank
Token Bank
Feature activation+3.259
that
Token that
Feature activation+2.774
Palestinians
Token Palestinians
Feature activation+3.400
to
Token to
Feature activation+1.871
advance
Token advance
Feature activation+2.966
plans
Token plans
Feature activation+0.973
for
Token for
Feature activation+3.559
560
Token 560
Feature activation+3.651
new
Token new
Feature activation+5.127
homes
Token homes
Feature activation+2.447
in
Token in
Feature activation+4.323
the
Token the
Feature activation+5.293
West
Token West
Feature activation+0.011
Bank
Token Bank
Feature activation+3.924
seek
Token seek
Feature activation+3.714
to
Token to
Feature activation+4.041
be
Token be
Feature activation+4.264
in
Token in
Feature activation+3.685
East
Token East
Feature activation+1.456
Jerusalem
Token Jerusalem
Feature activation+5.087
,
Token,
Feature activation+1.808
which
Token which
Feature activation+2.355
Israel
Token Israel
Feature activation+3.803
took
Token took
Feature activation+3.371
in
Token in
Feature activation+4.552
Bank
Token Bank
Feature activation+3.069
and
Token and
Feature activation+4.444
a
Token a
Feature activation+3.318
limited
Token limited
Feature activation+2.976
one
Token one
Feature activation+2.647
into
Token into
Feature activation+5.084
the
Token the
Feature activation+5.689
Gaza
Token Gaza
Feature activation+3.249
Strip
Token Strip
Feature activation+2.414
,
Token,
Feature activation+2.938
during
Token during
Feature activation+2.374
building
Token building
Feature activation+1.922
in
Token in
Feature activation+4.380
the
Token the
Feature activation+5.707
West
Token West
Feature activation+0.338
Bank
Token Bank
Feature activation+3.399
and
Token and
Feature activation+5.071
East
Token East
Feature activation+1.958
Jerusalem
Token Jerusalem
Feature activation+3.424
.
Token.
Feature activation+0.331
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.816
that
Token that
Feature activation+2.774
Palestinians
Token Palestinians
Feature activation+3.400
say
Token say
Feature activation+3.509
cripp
Token cripp
Feature activation+0.000
le
Tokenle
Feature activation+4.901
their
Token their
Feature activation+5.029
society
Token society
Feature activation+1.647
and
Token and
Feature activation+3.576
economy
Token economy
Feature activation+2.157
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
to
Token to
Feature activation+2.156
hold
Token hold
Feature activation+2.273
talks
Token talks
Feature activation+0.994
to
Token to
Feature activation+1.931
calm
Token calm
Feature activation+2.874
the
Token the
Feature activation+5.025
recent
Token recent
Feature activation+5.518
surge
Token surge
Feature activation+1.169
of
Token of
Feature activation+4.745
violence
Token violence
Feature activation+1.235
.
Token.
Feature activation+0.000
Palestinian
Token Palestinian
Feature activation+2.674
uprising
Token uprising
Feature activation+0.209
,
Token,
Feature activation+0.458
known
Token known
Feature activation+1.367
as
Token as
Feature activation+3.518
the
Token the
Feature activation+4.994
second
Token second
Feature activation+4.001
Int
Token Int
Feature activation+0.666
if
Tokenif
Feature activation+0.657
ada
Tokenada
Feature activation+1.627
.
Token.
Feature activation+0.264

Top DFA by src position
MAX = 3.004

that
Token that
Feature activation+0.028
profits
Token profits
Feature activation+0.021
from
Token from
Feature activation+0.204
settlements
Token settlements
Feature activation+0.277
built
Token built
Feature activation+0.131
on
Token on
Feature activation+3.004
occupied
Token occupied
Feature activation+1.692
Palestinian
Token Palestinian
Feature activation+0.967
and
Token and
Feature activation+0.063
Syrian
Token Syrian
Feature activation+0.155
land
Token land
Feature activation+0.000
Jerusalem
Token Jerusalem
Feature activation+0.052
,
Token,
Feature activation+0.004
which
Token which
Feature activation+0.023
Israel
Token Israel
Feature activation+0.603
took
Token took
Feature activation+0.168
in
Token in
Feature activation+2.324
the
Token the
Feature activation+0.461
1967
Token 1967
Feature activation+0.000
war
Token war
Feature activation+0.000
and
Token and
Feature activation+0.000
later
Token later
Feature activation+0.000
unless
Token unless
Feature activation+0.031
Israel
Token Israel
Feature activation+0.097
halted
Token halted
Feature activation+0.167
settlement
Token settlement
Feature activation+0.224
building
Token building
Feature activation+0.147
in
Token in
Feature activation+1.921
the
Token the
Feature activation+0.439
West
Token West
Feature activation+0.000
Bank
Token Bank
Feature activation+0.000
and
Token and
Feature activation+0.000
East
Token East
Feature activation+0.000
conducted
Token conducted
Feature activation+0.036
47
Token 47
Feature activation+0.016
inc
Token inc
Feature activation+0.039
ursions
Tokenursions
Feature activation+0.005
into
Token into
Feature activation+0.938
Palestinian
Token Palestinian
Feature activation+1.335
communities
Token communities
Feature activation+0.038
in
Token in
Feature activation+0.697
the
Token the
Feature activation+0.138
West
Token West
Feature activation+0.351
Bank
Token Bank
Feature activation+0.225
Abbas
Token Abbas
Feature activation+0.072
to
Token to
Feature activation+0.043
hold
Token hold
Feature activation+0.105
talks
Token talks
Feature activation+0.046
to
Token to
Feature activation+0.071
calm
Token calm
Feature activation+1.456
the
Token the
Feature activation+1.390
recent
Token recent
Feature activation+0.451
surge
Token surge
Feature activation+0.000
of
Token of
Feature activation+0.000
violence
Token violence
Feature activation+0.000
territories
Token territories
Feature activation+0.153
Israel
Token Israel
Feature activation+0.284
captured
Token captured
Feature activation+0.147
in
Token in
Feature activation+0.536
the
Token the
Feature activation+0.601
1967
Token 1967
Feature activation+2.240
M
Token M
Feature activation+0.273
ide
Tokenide
Feature activation+0.020
ast
Tokenast
Feature activation+0.433
war
Token war
Feature activation+0.000
.
Token.
Feature activation+0.000
Israel
Token Israel
Feature activation+0.601
's
Token's
Feature activation+0.441
decision
Token decision
Feature activation+0.097
to
Token to
Feature activation+0.363
set
Token set
Feature activation+0.651
up
Token up
Feature activation+1.362
added
Token added
Feature activation+0.134
security
Token security
Feature activation+0.000
measures
Token measures
Feature activation+0.000
at
Token at
Feature activation+0.000
the
Token the
Feature activation+0.000
increased
Token increased
Feature activation+0.012
tensions
Token tensions
Feature activation+0.041
at
Token at
Feature activation+0.198
the
Token the
Feature activation+0.137
Temple
Token Temple
Feature activation+0.602
Mount
Token Mount
Feature activation+0.704
between
Token between
Feature activation+0.133
Israel
Token Israel
Feature activation+0.458
and
Token and
Feature activation+0.103
the
Token the
Feature activation+0.065
Palestinians
Token Palestinians
Feature activation+0.646
plans
Token plans
Feature activation+0.014
for
Token for
Feature activation+0.245
560
Token 560
Feature activation+0.160
new
Token new
Feature activation+0.133
homes
Token homes
Feature activation+0.200
in
Token in
Feature activation+1.730
the
Token the
Feature activation+0.298
West
Token West
Feature activation+0.000
Bank
Token Bank
Feature activation+0.000
settlement
Token settlement
Feature activation+0.000
of
Token of
Feature activation+0.000
and
Token and
Feature activation+0.058
Israeli
Token Israeli
Feature activation+0.261
politicians
Token politicians
Feature activation+0.018
calling
Token calling
Feature activation+0.098
for
Token for
Feature activation+0.274
Jews
Token Jews
Feature activation+0.626
to
Token to
Feature activation+0.158
be
Token be
Feature activation+0.126
allowed
Token allowed
Feature activation+0.098
to
Token to
Feature activation+0.131
pray
Token pray
Feature activation+0.000
Bank
Token Bank
Feature activation+0.222
,
Token,
Feature activation+0.020
which
Token which
Feature activation+0.063
were
Token were
Feature activation+0.322
conquered
Token conquered
Feature activation+0.271
in
Token in
Feature activation+2.438
the
Token the
Feature activation+0.306
1967
Token 1967
Feature activation+0.000
war
Token war
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
which
Token which
Feature activation+0.126
are
Token are
Feature activation+0.143
observing
Token observing
Feature activation+0.344
a
Token a
Feature activation+0.196
truce
Token truce
Feature activation+0.138
after
Token after
Feature activation+1.636
a
Token a
Feature activation+0.680
week
Token week
Feature activation+0.130
of
Token of
Feature activation+0.625
fierce
Token fierce
Feature activation+0.272
fighting
Token fighting
Feature activation+0.000
settlements
Token settlements
Feature activation+0.207
and
Token and
Feature activation+0.059
ease
Token ease
Feature activation+0.122
security
Token security
Feature activation+0.017
restrictions
Token restrictions
Feature activation+0.017
in
Token in
Feature activation+1.812
the
Token the
Feature activation+0.342
occupied
Token occupied
Feature activation+0.000
West
Token West
Feature activation+0.000
Bank
Token Bank
Feature activation+0.000
that
Token that
Feature activation+0.000
this
Token this
Feature activation+0.000
week
Token week
Feature activation+0.003
to
Token to
Feature activation+0.197
advance
Token advance
Feature activation+0.462
plans
Token plans
Feature activation+0.062
for
Token for
Feature activation+1.567
560
Token 560
Feature activation+0.602
new
Token new
Feature activation+0.918
homes
Token homes
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.003
But
TokenBut
Feature activation-0.004
Palestinians
Token Palestinians
Feature activation+0.308
want
Token want
Feature activation+0.339
the
Token the
Feature activation+0.064
capital
Token capital
Feature activation+1.596
of
Token of
Feature activation+0.098
the
Token the
Feature activation+0.051
state
Token state
Feature activation+0.386
they
Token they
Feature activation+0.081
seek
Token seek
Feature activation+0.244
conducted
Token conducted
Feature activation+0.093
47
Token 47
Feature activation+0.035
inc
Token inc
Feature activation+0.077
ursions
Tokenursions
Feature activation+0.017
into
Token into
Feature activation+0.828
Palestinian
Token Palestinian
Feature activation+1.439
communities
Token communities
Feature activation+0.057
in
Token in
Feature activation+0.653
the
Token the
Feature activation+0.111
West
Token West
Feature activation+0.307
Bank
Token Bank
Feature activation+0.180
unless
Token unless
Feature activation+0.074
Israel
Token Israel
Feature activation+0.321
halted
Token halted
Feature activation+0.293
settlement
Token settlement
Feature activation+0.319
building
Token building
Feature activation+0.126
in
Token in
Feature activation+0.658
the
Token the
Feature activation+0.192
West
Token West
Feature activation+0.259
Bank
Token Bank
Feature activation+0.473
and
Token and
Feature activation+0.168
East
Token East
Feature activation+0.000
Bank
Token Bank
Feature activation+0.239
that
Token that
Feature activation+0.084
Palestinians
Token Palestinians
Feature activation+0.732
say
Token say
Feature activation+0.147
cripp
Token cripp
Feature activation-0.000
le
Tokenle
Feature activation+1.021
their
Token their
Feature activation+0.199
society
Token society
Feature activation+0.000
and
Token and
Feature activation+0.000
economy
Token economy
Feature activation+0.000
.
Token.
Feature activation+0.000
Abbas
Token Abbas
Feature activation+0.060
to
Token to
Feature activation+0.056
hold
Token hold
Feature activation+0.059
talks
Token talks
Feature activation+0.049
to
Token to
Feature activation+0.088
calm
Token calm
Feature activation+1.694
the
Token the
Feature activation+0.474
recent
Token recent
Feature activation+0.000
surge
Token surge
Feature activation+0.000
of
Token of
Feature activation+0.000
violence
Token violence
Feature activation+0.000
year
Tokenyear
Feature activation+0.007
Palestinian
Token Palestinian
Feature activation+1.074
uprising
Token uprising
Feature activation+0.003
,
Token,
Feature activation-0.004
known
Token known
Feature activation+0.026
as
Token as
Feature activation+1.457
the
Token the
Feature activation+1.070
second
Token second
Feature activation+0.000
Int
Token Int
Feature activation+0.000
if
Tokenif
Feature activation+0.000
ada
Tokenada
Feature activation+0.000

Decoder Weights Distribution

Head 0: 0.17

Head 1: 0.06

Head 2: 0.06

Head 3: 0.17

Head 4: 0.08

Head 5: 0.03

Head 6: 0.03

Head 7: 0.05

Head 8: 0.16

Head 9: 0.02

Head 10: 0.08

Head 11: 0.10

Positive logits

Palestinian4.73

Palestinians4.52

Palest4.43

Gaza4.39

Israeli4.30

Israelis4.05

Hamas4.04

Jerusalem3.96

Palestinian3.94

Israeli3.93

IDF3.88

Palestine3.88

Gaza3.84

sett3.73

Israel3.67

Jewish3.65

settlers3.61

Heb3.60

Israel3.56

Zionism3.56

Negative logits

pokemon-2.70

Brazil-2.49

mson-2.48

iami-2.44

Machina-2.40

osaurus-2.35

incinn-2.34

Bigfoot-2.30

borgh-2.29

Deadpool-2.27

Jackets-2.23

Influ-2.22

philis-2.22

Amazon-2.21

McCabe-2.20

SQL-2.19

Coil-2.18

Bucc-2.17

Ryzen-2.16

Florida-2.16

INTERVAL 5.948 - 6.609
CONTAINS 0.000%

built
Token built
Feature activation+1.400
on
Token on
Feature activation+2.945
occupied
Token occupied
Feature activation+4.053
Palestinian
Token Palestinian
Feature activation+4.452
and
Token and
Feature activation+3.665
Syrian
Token Syrian
Feature activation+6.273
land
Token land
Feature activation+1.739
in
Token in
Feature activation+3.486
violation
Token violation
Feature activation+1.277
of
Token of
Feature activation+2.519
international
Token international
Feature activation+2.857

INTERVAL 5.287 - 5.948
CONTAINS 0.000%

protests
Token protests
Feature activation+2.293
against
Token against
Feature activation+4.108
Jewish
Token Jewish
Feature activation+4.563
visitors
Token visitors
Feature activation+3.206
to
Token to
Feature activation+3.954
the
Token the
Feature activation+5.316
compound
Token compound
Feature activation+2.253
and
Token and
Feature activation+4.737
Israeli
Token Israeli
Feature activation+4.913
politicians
Token politicians
Feature activation+2.657
calling
Token calling
Feature activation+3.505
Israel
Token Israel
Feature activation+2.869
halted
Token halted
Feature activation+4.607
settlement
Token settlement
Feature activation+3.360
building
Token building
Feature activation+1.922
in
Token in
Feature activation+4.380
the
Token the
Feature activation+5.707
West
Token West
Feature activation+0.338
Bank
Token Bank
Feature activation+3.399
and
Token and
Feature activation+5.071
East
Token East
Feature activation+1.958
Jerusalem
Token Jerusalem
Feature activation+3.424
's
Token's
Feature activation+3.005
decision
Token decision
Feature activation+1.796
to
Token to
Feature activation+3.270
set
Token set
Feature activation+3.272
up
Token up
Feature activation+3.969
added
Token added
Feature activation+5.354
security
Token security
Feature activation+2.633
measures
Token measures
Feature activation+1.385
at
Token at
Feature activation+3.593
the
Token the
Feature activation+4.026
Temple
Token Temple
Feature activation+1.638
for
Token for
Feature activation+3.559
560
Token 560
Feature activation+3.651
new
Token new
Feature activation+5.127
homes
Token homes
Feature activation+2.447
in
Token in
Feature activation+4.323
the
Token the
Feature activation+5.293
West
Token West
Feature activation+0.011
Bank
Token Bank
Feature activation+3.924
settlement
Token settlement
Feature activation+3.469
of
Token of
Feature activation+4.707
Ma
Token Ma
Feature activation+1.683
,
Token,
Feature activation+1.808
which
Token which
Feature activation+2.355
Israel
Token Israel
Feature activation+3.803
took
Token took
Feature activation+3.371
in
Token in
Feature activation+4.552
the
Token the
Feature activation+5.867
1967
Token 1967
Feature activation+3.196
war
Token war
Feature activation+4.683
and
Token and
Feature activation+4.550
later
Token later
Feature activation+6.609
annexed
Token annexed
Feature activation+2.402

INTERVAL 4.626 - 5.287
CONTAINS 0.000%

Bank
Token Bank
Feature activation+3.069
and
Token and
Feature activation+4.444
a
Token a
Feature activation+3.318
limited
Token limited
Feature activation+2.976
one
Token one
Feature activation+2.647
into
Token into
Feature activation+5.084
the
Token the
Feature activation+5.689
Gaza
Token Gaza
Feature activation+3.249
Strip
Token Strip
Feature activation+2.414
,
Token,
Feature activation+2.938
during
Token during
Feature activation+2.374
Palestinian
Token Palestinian
Feature activation+2.674
uprising
Token uprising
Feature activation+0.209
,
Token,
Feature activation+0.458
known
Token known
Feature activation+1.367
as
Token as
Feature activation+3.518
the
Token the
Feature activation+4.994
second
Token second
Feature activation+4.001
Int
Token Int
Feature activation+0.666
if
Tokenif
Feature activation+0.657
ada
Tokenada
Feature activation+1.627
.
Token.
Feature activation+0.264
want
Token want
Feature activation+2.883
an
Token an
Feature activation+2.833
independent
Token independent
Feature activation+3.187
state
Token state
Feature activation+2.575
in
Token in
Feature activation+3.389
the
Token the
Feature activation+4.652
West
Token West
Feature activation+0.718
Bank
Token Bank
Feature activation+3.112
,
Token,
Feature activation+3.830
Gaza
Token Gaza
Feature activation+2.427
and
Token and
Feature activation+5.462
,
Token,
Feature activation+1.927
which
Token which
Feature activation+1.171
were
Token were
Feature activation+2.992
conquered
Token conquered
Feature activation+2.471
in
Token in
Feature activation+3.204
the
Token the
Feature activation+5.238
1967
Token 1967
Feature activation+2.929
war
Token war
Feature activation+2.836
.
Token.
Feature activation+0.470
The
Token The
Feature activation+2.838
British
Token British
Feature activation+2.435
building
Token building
Feature activation+1.922
in
Token in
Feature activation+4.380
the
Token the
Feature activation+5.707
West
Token West
Feature activation+0.338
Bank
Token Bank
Feature activation+3.399
and
Token and
Feature activation+5.071
East
Token East
Feature activation+1.958
Jerusalem
Token Jerusalem
Feature activation+3.424
.
Token.
Feature activation+0.331
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.816

INTERVAL 3.965 - 4.626
CONTAINS 0.001%

force
Token force
Feature activation+1.567
against
Token against
Feature activation+1.542
peaceful
Token peaceful
Feature activation+1.899
protests
Token protests
Feature activation+1.330
in
Token in
Feature activation+3.105
the
Token the
Feature activation+4.119
West
Token West
Feature activation+1.221
Bank
Token Bank
Feature activation+3.633
.
Token.
Feature activation+0.000
On
TokenOn
Feature activation+0.861
Friday
Token Friday
Feature activation+2.180
communities
Token communities
Feature activation+2.439
in
Token in
Feature activation+3.586
the
Token the
Feature activation+4.485
West
Token West
Feature activation+0.697
Bank
Token Bank
Feature activation+3.069
and
Token and
Feature activation+4.444
a
Token a
Feature activation+3.318
limited
Token limited
Feature activation+2.976
one
Token one
Feature activation+2.647
into
Token into
Feature activation+5.084
the
Token the
Feature activation+5.689
closed
Token closed
Feature activation+2.195
or
Token or
Feature activation+2.458
fully
Token fully
Feature activation+2.527
controlled
Token controlled
Feature activation+2.029
by
Token by
Feature activation+2.583
Israeli
Token Israeli
Feature activation+4.107
forces
Token forces
Feature activation+2.110
.
Token.
Feature activation+0.744
There
Token There
Feature activation+1.249
are
Token are
Feature activation+1.730
approximately
Token approximately
Feature activation+0.308
agreed
Token agreed
Feature activation+4.324
and
Token and
Feature activation+2.404
"
Token "
Feature activation+1.700
min
Tokenmin
Feature activation+0.144
or
Tokenor
Feature activation+1.864
"
Token"
Feature activation+4.258
land
Token land
Feature activation+2.111
swaps
Token swaps
Feature activation+1.482
between
Token between
Feature activation+2.477
the
Token the
Feature activation+3.813
Israelis
Token Israelis
Feature activation+1.660
hood
Tokenhood
Feature activation+2.369
.
Token.
Feature activation+1.156
The
Token The
Feature activation+3.289
last
Token last
Feature activation+3.500
round
Token round
Feature activation+1.857
of
Token of
Feature activation+4.389
US
Token US
Feature activation+3.564
-
Token-
Feature activation+3.581
led
Tokenled
Feature activation+4.646
peace
Token peace
Feature activation+3.892
talks
Token talks
Feature activation+1.784

INTERVAL 3.305 - 3.965
CONTAINS 0.004%

to
Token to
Feature activation+3.370
Jews
Token Jews
Feature activation+2.680
as
Token as
Feature activation+3.679
Temple
Token Temple
Feature activation+1.626
Mount
Token Mount
Feature activation+2.708
and
Token and
Feature activation+3.732
Muslims
Token Muslims
Feature activation+2.387
as
Token as
Feature activation+3.042
Haram
Token Haram
Feature activation+1.139
al
Token al
Feature activation+2.358
-
Token-
Feature activation+1.526
But
TokenBut
Feature activation+3.735
Mr
Token Mr
Feature activation+3.590
Netanyahu
Token Netanyahu
Feature activation+1.057
defended
Token defended
Feature activation+2.819
Israel
Token Israel
Feature activation+1.494
's
Token's
Feature activation+3.441
actions
Token actions
Feature activation+0.627
,
Token,
Feature activation+1.558
saying
Token saying
Feature activation+2.775
it
Token it
Feature activation+2.280
was
Token was
Feature activation+2.181
Jerusalem
Token Jerusalem
Feature activation+0.000
as
Token as
Feature activation+0.381
Israel
Token Israel
Feature activation+0.705
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.727
s
Tokens
Feature activation+3.367
capital
Token capital
Feature activation+0.117
.
Token.
Feature activation+0.000
That
Token That
Feature activation+0.688
broke
Token broke
Feature activation+0.301
with
Token with
Feature activation+0.550
between
Token between
Feature activation+2.939
the
Token the
Feature activation+3.226
different
Token different
Feature activation+3.455
parts
Token parts
Feature activation+2.371
of
Token of
Feature activation+2.964
the
Token the
Feature activation+3.433
occupied
Token occupied
Feature activation+2.567
Palestinian
Token Palestinian
Feature activation+3.250
territory
Token territory
Feature activation+2.881
remains
Token remains
Feature activation+1.235
restricted
Token restricted
Feature activation+1.312
homes
Token homes
Feature activation+2.000
following
Token following
Feature activation+2.891
Israel
Token Israel
Feature activation+1.504
's
Token's
Feature activation+4.088
establishment
Token establishment
Feature activation+2.684
in
Token in
Feature activation+3.877
1948
Token 1948
Feature activation+1.794
.
Token.
Feature activation+0.433
They
Token They
Feature activation+1.425
observed
Token observed
Feature activation+1.627
the
Token the
Feature activation+3.062

INTERVAL 2.644 - 3.305
CONTAINS 0.009%

gone
Token gone
Feature activation+0.725
on
Token on
Feature activation+0.988
violating
Token violating
Feature activation+1.604
the
Token the
Feature activation+2.479
most
Token most
Feature activation+1.879
recent
Token recent
Feature activation+3.004
cease
Token cease
Feature activation+0.317
fire
Token fire
Feature activation+1.219
even
Token even
Feature activation+0.952
more
Token more
Feature activation+0.746
brazen
Token brazen
Feature activation+0.133
UN
Token UN
Feature activation+2.023
DP
TokenDP
Feature activation+2.592
project
Token project
Feature activation+1.453
in
Token in
Feature activation+2.541
the
Token the
Feature activation+3.061
Gaza
Token Gaza
Feature activation+2.974
Strip
Token Strip
Feature activation+1.412
,
Token,
Feature activation+0.712
run
Token run
Feature activation+1.290
by
Token by
Feature activation+1.915
Hamas
Token Hamas
Feature activation+2.544
olerance
Tokenolerance
Feature activation+2.753
"
Token"
Feature activation+3.461
project
Token project
Feature activation+1.466
,
Token,
Feature activation+1.570
a
Token a
Feature activation+2.420
local
Token local
Feature activation+3.109
committee
Token committee
Feature activation+1.901
said
Token said
Feature activation+0.870
Tuesday
Token Tuesday
Feature activation+1.112
.
Token.
Feature activation+0.112
Ċ
TokenĊ
Feature activation+0.693
residential
Token residential
Feature activation+1.611
areas
Token areas
Feature activation+0.858
of
Token of
Feature activation+3.179
Raf
Token Raf
Feature activation+0.044
ah
Tokenah
Feature activation+1.876
in
Token in
Feature activation+2.661
order
Token order
Feature activation+0.192
to
Token to
Feature activation+1.932
foil
Token foil
Feature activation+1.495
the
Token the
Feature activation+2.111
capture
Token capture
Feature activation+0.848
must
Token must
Feature activation+0.799
be
Token be
Feature activation+1.657
left
Token left
Feature activation+1.400
to
Token to
Feature activation+1.847
Israeli
Token Israeli
Feature activation+3.073
-
Token-
Feature activation+2.908
Palestinian
TokenPalestinian
Feature activation+3.595
peace
Token peace
Feature activation+1.896
negotiations
Token negotiations
Feature activation+0.163
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000

INTERVAL 1.983 - 2.644
CONTAINS 0.020%

the
Token the
Feature activation+1.200
country
Token country
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+1.321
dire
Token dire
Feature activation+2.014
financial
Token financial
Feature activation+0.727
situation
Token situation
Feature activation+0.000
:
Token:
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
three
Token three
Feature activation+2.674
of
Token of
Feature activation+1.323
the
Token the
Feature activation+3.086
Palestinians
Token Palestinians
Feature activation+2.627
who
Token who
Feature activation+3.079
carried
Token carried
Feature activation+2.088
out
Token out
Feature activation+2.951
attacks
Token attacks
Feature activation+1.820
on
Token on
Feature activation+2.864
Israelis
Token Israelis
Feature activation+2.734
this
Token this
Feature activation+2.679
Sh
TokenSh
Feature activation+0.443
u
Tokenu
Feature activation+0.486
ja
Tokenja
Feature activation+0.753
âĢ
TokenâĢ
Feature activation+0.579
Ļ
TokenĻ
Feature activation+2.185
iya
Tokeniya
Feature activation+2.561
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Spring
TokenSpring
Feature activation+0.000
b
Tokenb
Feature activation+0.000
oks
Tokenoks
Feature activation+0.000
coach
Token coach
Feature activation+0.000
of
Token of
Feature activation+1.763
men
Token men
Feature activation+1.091
throwing
Token throwing
Feature activation+1.138
stones
Token stones
Feature activation+0.597
at
Token at
Feature activation+1.578
Israeli
Token Israeli
Feature activation+2.366
security
Token security
Feature activation+0.951
forces
Token forces
Feature activation+1.769
in
Token in
Feature activation+3.743
Ram
Token Ram
Feature activation+0.482
allah
Tokenallah
Feature activation+2.039
the
Token the
Feature activation+3.156
EU
Token EU
Feature activation+2.272
have
Token have
Feature activation+2.840
repeatedly
Token repeatedly
Feature activation+3.147
said
Token said
Feature activation+2.754
Israel
Token Israel
Feature activation+2.570
's
Token's
Feature activation+4.369
settlement
Token settlement
Feature activation+3.789
project
Token project
Feature activation+2.703
is
Token is
Feature activation+3.851
an
Token an
Feature activation+3.785

INTERVAL 1.322 - 1.983
CONTAINS 0.041%

two
Token two
Feature activation+2.071
Palestinians
Token Palestinians
Feature activation+1.240
were
Token were
Feature activation+1.585
injured
Token injured
Feature activation+0.584
in
Token in
Feature activation+1.433
the
Token the
Feature activation+1.861
gun
Token gun
Feature activation+0.773
fight
Tokenfight
Feature activation+0.881
,
Token,
Feature activation+1.448
Ag
Token Ag
Feature activation+0.115
ence
Tokenence
Feature activation+0.183
soon
Token soon
Feature activation+0.000
as
Token as
Feature activation+1.074
they
Token they
Feature activation+0.690
have
Token have
Feature activation+0.537
submitted
Token submitted
Feature activation+0.882
their
Token their
Feature activation+1.621
application
Token application
Feature activation+0.214
,
Token,
Feature activation+0.260
arguing
Token arguing
Feature activation+1.007
that
Token that
Feature activation+1.223
even
Token even
Feature activation+0.912
were
Token were
Feature activation+0.551
unlikely
Token unlikely
Feature activation+0.614
to
Token to
Feature activation+0.559
raise
Token raise
Feature activation+1.259
objections
Token objections
Feature activation+0.089
to
Token to
Feature activation+1.401
a
Token a
Feature activation+1.995
compromise
Token compromise
Feature activation+1.306
offer
Token offer
Feature activation+0.050
being
Token being
Feature activation+0.731
advanced
Token advanced
Feature activation+0.000
some
Token some
Feature activation+0.000
200
Token 200
Feature activation+0.000
meters
Token meters
Feature activation+0.674
east
Token east
Feature activation+0.000
of
Token of
Feature activation+1.182
the
Token the
Feature activation+1.740
Green
Token Green
Feature activation+0.245
Line
Token Line
Feature activation+1.320
,
Token,
Feature activation+0.755
not
Token not
Feature activation+0.000
far
Token far
Feature activation+0.147
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+2.330
s
Tokens
Feature activation+1.909
home
Token home
Feature activation+1.559
in
Token in
Feature activation+1.336
the
Token the
Feature activation+1.664
south
Token south
Feature activation+1.869
Heb
Token Heb
Feature activation+0.000
ron
Tokenron
Feature activation+2.805
Hills
Token Hills
Feature activation+3.331
village
Token village
Feature activation+1.454

INTERVAL 0.661 - 1.322
CONTAINS 0.081%

in
Token in
Feature activation+1.368
plain
Token plain
Feature activation+1.553
view
Token view
Feature activation+0.382
of
Token of
Feature activation+0.811
the
Token the
Feature activation+1.611
medical
Token medical
Feature activation+0.898
team
Token team
Feature activation+0.308
after
Token after
Feature activation+0.033
he
Token he
Feature activation+1.039
had
Token had
Feature activation+0.898
already
Token already
Feature activation+1.061
May
Token May
Feature activation+1.072
2010
Token 2010
Feature activation+0.296
,
Token,
Feature activation+0.156
when
Token when
Feature activation+1.152
Israeli
Token Israeli
Feature activation+2.822
command
Token command
Feature activation+1.090
os
Tokenos
Feature activation+3.909
killed
Token killed
Feature activation+1.916
nine
Token nine
Feature activation+2.721
Turkish
Token Turkish
Feature activation+2.592
citizens
Token citizens
Feature activation+2.212
vacuum
Token vacuum
Feature activation+0.000
and
Token and
Feature activation+0.359
emerge
Token emerge
Feature activation+0.332
with
Token with
Feature activation+0.229
a
Token a
Feature activation+0.184
new
Token new
Feature activation+0.728
leader
Token leader
Feature activation+0.000
lacking
Token lacking
Feature activation+0.366
any
Token any
Feature activation+0.000
credibility
Token credibility
Feature activation+0.000
.
Token.
Feature activation+0.000
Syrian
Token Syrian
Feature activation+0.802
territory
Token territory
Feature activation+0.755
--
Token --
Feature activation+0.695
again
Token again
Feature activation+0.373
,
Token,
Feature activation+0.600
possibly
Token possibly
Feature activation+0.796
widening
Token widening
Feature activation+1.287
the
Token the
Feature activation+1.583
Middle
Token Middle
Feature activation+0.182
East
Token East
Feature activation+2.082
conflict
Token conflict
Feature activation+1.272
as
Token as
Feature activation+1.848
rocket
Token rocket
Feature activation+0.957
strikes
Token strikes
Feature activation+0.695
continue
Token continue
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.755
Justice
TokenJustice
Feature activation+0.612
Minister
Token Minister
Feature activation+1.769
T
Token T
Feature activation+1.376
zip
Tokenzip
Feature activation+0.390
i
Tokeni
Feature activation+0.207

INTERVAL 0.000 - 0.661
CONTAINS 99.844%

Reserve
Token Reserve
Feature activation+0.000
University
Token University
Feature activation+0.000
have
Token have
Feature activation+0.000
announced
Token announced
Feature activation+0.000
safe
Token safe
Feature activation+0.000
spaces
Token spaces
Feature activation+0.000
to
Token to
Feature activation+0.000
protect
Token protect
Feature activation+0.000
students
Token students
Feature activation+0.000
from
Token from
Feature activation+0.000
unwelcome
Token unwelcome
Feature activation+0.000
equivalent
Token equivalent
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
electricity
Token electricity
Feature activation+0.000
usage
Token usage
Feature activation+0.000
of
Token of
Feature activation+0.000
400
Token 400
Feature activation+0.000
homes
Token homes
Feature activation+0.000
or
Token or
Feature activation+0.000
the
Token the
Feature activation+0.000
fuel
Token fuel
Feature activation+0.000
empires
Token empires
Feature activation+0.000
.
Token.
Feature activation+0.000
Did
Token Did
Feature activation+0.000
you
Token you
Feature activation+0.000
hear
Token hear
Feature activation+0.000
about
Token about
Feature activation+0.000
the
Token the
Feature activation+0.000
guy
Token guy
Feature activation+0.000
who
Token who
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
alarm
Token alarm
Feature activation+0.000
went
Token went
Feature activation+0.000
off
Token off
Feature activation+0.000
as
Token as
Feature activation+0.000
the
Token the
Feature activation+0.000
prisoners
Token prisoners
Feature activation+0.000
went
Token went
Feature activation+0.000
back
Token back
Feature activation+0.000
after
Token after
Feature activation+0.000
dinner
Token dinner
Feature activation+0.000
.
Token.
Feature activation+0.000
âĢ¦
TokenâĢ¦
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
(
Token (
Feature activation+0.000
See
TokenSee
Feature activation+0.000
Figure
Token Figure
Feature activation+0.000
B
Token B
Feature activation+0.000
.)
Token.)
Feature activation+0.000
The
Token The
Feature activation+0.000
rhetorical
Token rhetorical
Feature activation+0.000
free
Token free
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 8: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.054

whatever
Token whatever
Feature activation-0.725
policy
Token policy
Feature activation-0.099
prescriptions
Token prescriptions
Feature activation-0.445
that
Token that
Feature activation-0.301
we
Token we
Feature activation-0.041
've
Token've
Feature activation+0.049
been
Token been
Feature activation-0.222
proposing
Token proposing
Feature activation-0.123
don
Token don
Feature activation-0.109
't
Token't
Feature activation-0.390
reach
Token reach
Feature activation-2.528
<|endoftext|>
Token<|endoftext|>
Feature activation-6.246
perspective
Token perspective
Feature activation-0.099
.
Token.
Feature activation-0.082
"
Token "
Feature activation+0.076
What
TokenWhat
Feature activation-0.521
is
Token is
Feature activation-0.316
true
Token true
Feature activation-0.696
,
Token,
Feature activation-0.026
though
Token though
Feature activation-0.290
prescriptions
Token prescriptions
Feature activation-0.084
that
Token that
Feature activation-0.155
we
Token we
Feature activation+0.031
've
Token've
Feature activation-0.038
been
Token been
Feature activation-0.126
proposing
Token proposing
Feature activation+0.270
don
Token don
Feature activation-1.057
't
Token't
Feature activation-0.469
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.534
policy
Token policy
Feature activation-0.403
prescriptions
Token prescriptions
Feature activation-0.341
that
Token that
Feature activation-0.109
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.035
been
Token been
Feature activation-0.110
proposing
Token proposing
Feature activation-0.175
don
Token don
Feature activation-0.167
't
Token't
Feature activation-0.171
reach
Token reach
Feature activation-1.563
.
Token.
Feature activation-0.554
"
Token "
Feature activation-0.012
What
TokenWhat
Feature activation-0.385
is
Token is
Feature activation-0.222
true
Token true
Feature activation-0.928
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.476
perspective
Token perspective
Feature activation-0.254
.
Token.
Feature activation-0.428
"
Token "
Feature activation+0.578
What
TokenWhat
Feature activation-0.403
is
Token is
Feature activation-0.554
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.944
perspective
Token perspective
Feature activation-0.524
.
Token.
Feature activation-0.201
"
Token "
Feature activation+1.054
What
TokenWhat
Feature activation-0.946
is
Token is
Feature activation-0.682
true
Token true
Feature activation-1.723
,
Token,
Feature activation-0.284
though
Token though
Feature activation+0.000
What
TokenWhat
Feature activation-0.605
is
Token is
Feature activation-0.389
true
Token true
Feature activation-0.126
,
Token,
Feature activation-0.235
though
Token though
Feature activation-1.586
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.286
perspective
Token perspective
Feature activation-0.037
.
Token.
Feature activation-0.022
"
Token "
Feature activation+0.086
What
TokenWhat
Feature activation-0.116
is
Token is
Feature activation-0.238
true
Token true
Feature activation-0.213
,
Token,
Feature activation+0.006
though
Token though
Feature activation-0.145
that
Token that
Feature activation-0.718
whatever
Token whatever
Feature activation-0.489
policy
Token policy
Feature activation-0.050
prescriptions
Token prescriptions
Feature activation-0.582
that
Token that
Feature activation-0.470
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.838
perspective
Token perspective
Feature activation-0.150
.
Token.
Feature activation-0.278
"
Token "
Feature activation-0.154
What
TokenWhat
Feature activation+0.076
is
Token is
Feature activation-0.459
true
Token true
Feature activation-0.585
,
Token,
Feature activation-0.050
though
Token though
Feature activation-0.327
,
Token,
Feature activation-0.197
though
Token though
Feature activation-0.440
,
Token,
Feature activation-0.288
is
Token is
Feature activation-0.352
that
Token that
Feature activation-0.906
whatever
Token whatever
Feature activation-0.813
policy
Token policy
Feature activation+0.062
prescriptions
Token prescriptions
Feature activation-0.536
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation-0.728
that
Token that
Feature activation-0.427
we
Token we
Feature activation-0.014
've
Token've
Feature activation-0.218
been
Token been
Feature activation-0.599
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation-0.828
whatever
Token whatever
Feature activation-0.404
policy
Token policy
Feature activation-0.270
prescriptions
Token prescriptions
Feature activation-0.699
that
Token that
Feature activation-0.490
we
Token we
Feature activation+0.056
've
Token've
Feature activation-0.464
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation-0.161
that
Token that
Feature activation-0.326
we
Token we
Feature activation-0.096
've
Token've
Feature activation-0.052
been
Token been
Feature activation-0.157
proposing
Token proposing
Feature activation+0.459
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.230
policy
Token policy
Feature activation-0.086
prescriptions
Token prescriptions
Feature activation-0.262
that
Token that
Feature activation-0.079
we
Token we
Feature activation-0.009
've
Token've
Feature activation+0.070
been
Token been
Feature activation-0.035
proposing
Token proposing
Feature activation-0.047
don
Token don
Feature activation-6.103
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.309
perspective
Token perspective
Feature activation-0.059
.
Token.
Feature activation-0.230
"
Token "
Feature activation+0.006
What
TokenWhat
Feature activation-0.323
is
Token is
Feature activation-0.489
true
Token true
Feature activation-0.982
,
Token,
Feature activation-0.137
though
Token though
Feature activation-0.302
<|endoftext|>
Token<|endoftext|>
Feature activation-7.375
perspective
Token perspective
Feature activation-0.130
.
Token.
Feature activation-0.248
"
Token "
Feature activation+0.389
What
TokenWhat
Feature activation-0.429
is
Token is
Feature activation-0.348
true
Token true
Feature activation-0.804
,
Token,
Feature activation-0.188
though
Token though
Feature activation-0.432
is
Token is
Feature activation-0.376
true
Token true
Feature activation-0.471
,
Token,
Feature activation-0.122
though
Token though
Feature activation-1.142
,
Token,
Feature activation-0.710
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.259
perspective
Token perspective
Feature activation-0.108
.
Token.
Feature activation-0.430
"
Token "
Feature activation+0.079
What
TokenWhat
Feature activation-0.365
is
Token is
Feature activation-0.294
true
Token true
Feature activation-1.488
,
Token,
Feature activation-0.139
though
Token though
Feature activation-0.739

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.08

Head 2: 0.08

Head 3: 0.08

Head 4: 0.07

Head 5: 0.08

Head 6: 0.08

Head 7: 0.08

Head 8: 0.09

Head 9: 0.09

Head 10: 0.09

Head 11: 0.09

Positive logits

alysed3.05

osp2.92

dstg2.86

usalem2.72

natureconservancy2.71

elight2.68

archive2.68

luaj2.67

osphere2.67

cffffcc2.64

SpaceEngineers2.60

EStreamFrame2.59

quickShipAvailable2.58

reon2.56

atorium2.55

leans2.54

Writ2.52

hiba2.50

thouse2.49

*/(2.47

Negative logits

dart-3.01

Tay-2.80

deserve-2.70

Trap-2.64

rounding-2.53

FUL-2.50

grade-2.49

Laz-2.48

par-2.46

await-2.46

Aber-2.45

incons-2.45

Malone-2.40

disg-2.40

darts-2.35

Annie-2.35

Lag-2.33

outweigh-2.31

todd-2.29

Barkley-2.28

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

other
Token other
Feature activation+0.000
IM
Token IM
Feature activation+0.000
services
Token services
Feature activation+0.000
that
Token that
Feature activation+0.000
distribute
Token distribute
Feature activation+0.000
messages
Token messages
Feature activation+0.000
through
Token through
Feature activation+0.000
central
Token central
Feature activation+0.000
servers
Token servers
Feature activation+0.000
which
Token which
Feature activation+0.000
could
Token could
Feature activation+0.000
43
Token 43
Feature activation+0.000
,
Token,
Feature activation+0.000
was
Token was
Feature activation+0.000
gunned
Token gunned
Feature activation+0.000
down
Token down
Feature activation+0.000
by
Token by
Feature activation+0.000
Officer
Token Officer
Feature activation+0.000
Brent
Token Brent
Feature activation+0.000
ley
Tokenley
Feature activation+0.000
V
Token V
Feature activation+0.000
inson
Tokeninson
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
economic
Token economic
Feature activation+0.000
activities
Token activities
Feature activation+0.000
on
Token on
Feature activation+0.000
the
Token the
Feature activation+0.000
territories
Token territories
Feature activation+0.000
of
Token of
Feature activation+0.000
third
Token third
Feature activation+0.000
states
Token states
Feature activation+0.000
.
Token.
Feature activation+0.000
among
Token among
Feature activation+0.000
other
Token other
Feature activation+0.000
GOP
Token GOP
Feature activation+0.000
factions
Token factions
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
hard
Token hard
Feature activation+0.000
-
Token-
Feature activation+0.000
right
Tokenright
Feature activation+0.000
Freedom
Token Freedom
Feature activation+0.000
Caucus
Token Caucus
Feature activation+0.000
vers
Tokenvers
Feature activation+0.000
,
Token,
Feature activation+0.000
mountains
Token mountains
Feature activation+0.000
,
Token,
Feature activation+0.000
forests
Token forests
Feature activation+0.000
and
Token and
Feature activation+0.000
all
Token all
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 9: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 0.609

been
Token been
Feature activation-0.044
proposing
Token proposing
Feature activation+0.074
don
Token don
Feature activation-0.052
't
Token't
Feature activation-0.002
reach
Token reach
Feature activation-1.107
,
Token,
Feature activation+0.198
are
Token are
Feature activation-0.070
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
been
Token been
Feature activation-0.049
proposing
Token proposing
Feature activation-0.052
don
Token don
Feature activation-0.016
't
Token't
Feature activation-0.008
reach
Token reach
Feature activation-0.639
,
Token,
Feature activation+0.302
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation-0.016
that
Token that
Feature activation+0.014
we
Token we
Feature activation-0.069
've
Token've
Feature activation-0.006
been
Token been
Feature activation-0.073
proposing
Token proposing
Feature activation+0.232
don
Token don
Feature activation-0.186
't
Token't
Feature activation+0.027
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
we
Token we
Feature activation-0.032
've
Token've
Feature activation+0.017
been
Token been
Feature activation-0.029
proposing
Token proposing
Feature activation-0.057
don
Token don
Feature activation-0.024
't
Token't
Feature activation+0.080
reach
Token reach
Feature activation-0.648
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
perspective
Token perspective
Feature activation-0.042
.
Token.
Feature activation-0.044
"
Token "
Feature activation-0.046
What
TokenWhat
Feature activation-0.013
is
Token is
Feature activation+0.248
true
Token true
Feature activation+0.260
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-2.643
perspective
Token perspective
Feature activation+0.006
.
Token.
Feature activation+0.015
"
Token "
Feature activation+0.150
What
TokenWhat
Feature activation-0.012
is
Token is
Feature activation+0.146
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-2.223
perspective
Token perspective
Feature activation-0.093
.
Token.
Feature activation+0.094
"
Token "
Feature activation+0.414
What
TokenWhat
Feature activation-0.153
is
Token is
Feature activation-0.067
true
Token true
Feature activation-0.281
,
Token,
Feature activation-0.318
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.056
.
Token.
Feature activation+0.141
"
Token "
Feature activation-1.294
What
TokenWhat
Feature activation-0.132
is
Token is
Feature activation-0.008
true
Token true
Feature activation+0.609
,
Token,
Feature activation-0.303
though
Token though
Feature activation-0.716
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.006
is
Token is
Feature activation+0.023
that
Token that
Feature activation-0.255
whatever
Token whatever
Feature activation-0.132
policy
Token policy
Feature activation-0.143
prescriptions
Token prescriptions
Feature activation+0.111
that
Token that
Feature activation-0.136
we
Token we
Feature activation-0.265
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
is
Token is
Feature activation+0.005
true
Token true
Feature activation+0.003
,
Token,
Feature activation-0.080
though
Token though
Feature activation-0.141
,
Token,
Feature activation-0.051
is
Token is
Feature activation+0.059
that
Token that
Feature activation-0.293
whatever
Token whatever
Feature activation-0.208
policy
Token policy
Feature activation-0.322
prescriptions
Token prescriptions
Feature activation-0.177
that
Token that
Feature activation-0.077
<|endoftext|>
Token<|endoftext|>
Feature activation-2.491
perspective
Token perspective
Feature activation-0.039
.
Token.
Feature activation+0.024
"
Token "
Feature activation-0.048
What
TokenWhat
Feature activation+0.031
is
Token is
Feature activation-0.052
true
Token true
Feature activation-0.007
,
Token,
Feature activation-0.049
though
Token though
Feature activation-0.133
,
Token,
Feature activation-0.047
<|endoftext|>
Token<|endoftext|>
Feature activation-2.120
perspective
Token perspective
Feature activation-0.023
.
Token.
Feature activation+0.055
"
Token "
Feature activation-0.007
What
TokenWhat
Feature activation+0.021
is
Token is
Feature activation-0.009
true
Token true
Feature activation-0.143
,
Token,
Feature activation-0.171
,
Token,
Feature activation-0.015
is
Token is
Feature activation+0.033
that
Token that
Feature activation-0.175
whatever
Token whatever
Feature activation-0.140
policy
Token policy
Feature activation-0.106
prescriptions
Token prescriptions
Feature activation+0.286
that
Token that
Feature activation-0.079
we
Token we
Feature activation-0.250
've
Token've
Feature activation-0.099
been
Token been
Feature activation-0.171
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.030
is
Token is
Feature activation+0.024
that
Token that
Feature activation-0.219
whatever
Token whatever
Feature activation-0.159
policy
Token policy
Feature activation-0.138
prescriptions
Token prescriptions
Feature activation+0.261
that
Token that
Feature activation-0.151
we
Token we
Feature activation-0.273
've
Token've
Feature activation-0.174
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation-0.007
that
Token that
Feature activation-0.098
we
Token we
Feature activation-0.082
've
Token've
Feature activation-0.007
been
Token been
Feature activation-0.068
proposing
Token proposing
Feature activation+0.351
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.079
policy
Token policy
Feature activation-0.015
prescriptions
Token prescriptions
Feature activation-0.112
that
Token that
Feature activation-0.025
we
Token we
Feature activation-0.011
've
Token've
Feature activation+0.030
been
Token been
Feature activation-0.007
proposing
Token proposing
Feature activation-0.002
don
Token don
Feature activation-1.012
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
is
Token is
Feature activation-0.022
true
Token true
Feature activation-0.062
,
Token,
Feature activation-0.153
though
Token though
Feature activation-0.140
,
Token,
Feature activation-0.080
is
Token is
Feature activation+0.096
that
Token that
Feature activation-0.269
whatever
Token whatever
Feature activation-0.296
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
is
Token is
Feature activation+0.052
true
Token true
Feature activation-0.065
,
Token,
Feature activation-0.221
though
Token though
Feature activation-0.184
,
Token,
Feature activation-0.075
is
Token is
Feature activation+0.193
that
Token that
Feature activation-0.320
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
perspective
Token perspective
Feature activation-0.061
.
Token.
Feature activation+0.033
"
Token "
Feature activation-0.059
What
TokenWhat
Feature activation-0.133
is
Token is
Feature activation-0.024
true
Token true
Feature activation+0.540
,
Token,
Feature activation-0.248
though
Token though
Feature activation-0.530
,
Token,
Feature activation-0.379
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-2.320
perspective
Token perspective
Feature activation+0.010
.
Token.
Feature activation+0.019
"
Token "
Feature activation-0.182
What
TokenWhat
Feature activation-0.034
is
Token is
Feature activation+0.040
true
Token true
Feature activation-0.209
,
Token,
Feature activation-0.166
though
Token though
Feature activation-0.328
,
Token,
Feature activation-0.176
is
Token is
Feature activation-0.003

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.07

Head 2: 0.08

Head 3: 0.08

Head 4: 0.07

Head 5: 0.08

Head 6: 0.09

Head 7: 0.08

Head 8: 0.10

Head 9: 0.10

Head 10: 0.08

Head 11: 0.09

Positive logits

MpServer3.27

Vendor3.06

beetles2.96

olor2.91

beetle2.91

onite2.89

Borough2.86

rodent2.80

bery2.80

aceae2.78

765612.77

Agency2.72

Commissioner2.69

boro2.68

manufact2.67

bryce2.67

Veter2.65

cinnamon2.65

pigeon2.62

untary2.58

Negative logits

ㅋㅋ-2.82

nos-2.79

-2.75

-2.73

icial-2.70

-2.70

Cu-2.67

-2.60

Na-2.55

-2.54

iqu-2.52

Sloven-2.50

Ge-2.49

Hur-2.49

Stone-2.48

equ-2.47

Yan-2.46

Yuan-2.44

-2.43

[/-2.43

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

became
Token became
Feature activation+0.000
one
Token one
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
most
Token most
Feature activation+0.000
acclaimed
Token acclaimed
Feature activation+0.000
indie
Token indie
Feature activation+0.000
games
Token games
Feature activation+0.000
out
Token out
Feature activation+0.000
there
Token there
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
***
Token***
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
V
TokenV
Feature activation+0.000
ampire
Tokenampire
Feature activation+0.000
!
Token!
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
the
Token the
Feature activation+0.000
Irish
Token Irish
Feature activation+0.000
artist
Token artist
Feature activation+0.000
Robert
Token Robert
Feature activation+0.000
Barker
Token Barker
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Visual
TokenVisual
Feature activation+0.000
ise
Tokenise
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Hindi
Token Hindi
Feature activation+0.000
and
Token and
Feature activation+0.000
other
Token other
Feature activation+0.000
languages
Token languages
Feature activation+0.000
d
Token d
Feature activation+0.000
ents
Tokenents
Feature activation+0.000
their
Token their
Feature activation+0.000
job
Token job
Feature activation+0.000
prospects
Token prospects
Feature activation+0.000
both
Token both
Feature activation+0.000
in
Token in
Feature activation+0.000
Molecular
Token Molecular
Feature activation+0.000
nan
Token nan
Feature activation+0.000
otechnology
Tokenotechnology
Feature activation+0.000
is
Token is
Feature activation+0.000
om
Token om
Feature activation+0.000
ni
Tokenni
Feature activation+0.000
-
Token-
Feature activation+0.000
present
Tokenpresent
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
novel
Token novel
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 10: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 0.660

been
Token been
Feature activation-0.104
proposing
Token proposing
Feature activation-0.191
don
Token don
Feature activation-0.060
't
Token't
Feature activation-0.215
reach
Token reach
Feature activation-0.425
,
Token,
Feature activation+0.095
are
Token are
Feature activation-0.110
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
been
Token been
Feature activation-0.109
proposing
Token proposing
Feature activation-0.141
don
Token don
Feature activation-0.101
't
Token't
Feature activation-0.201
reach
Token reach
Feature activation-0.195
,
Token,
Feature activation+0.109
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.041
that
Token that
Feature activation+0.037
whatever
Token whatever
Feature activation+0.083
policy
Token policy
Feature activation-0.076
prescriptions
Token prescriptions
Feature activation-0.078
that
Token that
Feature activation+0.097
we
Token we
Feature activation-0.024
've
Token've
Feature activation-0.071
been
Token been
Feature activation-0.068
proposing
Token proposing
Feature activation-0.047
don
Token don
Feature activation-0.300
true
Token true
Feature activation-0.018
,
Token,
Feature activation-0.027
though
Token though
Feature activation-0.125
,
Token,
Feature activation+0.008
is
Token is
Feature activation+0.056
that
Token that
Feature activation+0.265
whatever
Token whatever
Feature activation+0.061
policy
Token policy
Feature activation-0.097
prescriptions
Token prescriptions
Feature activation-0.175
that
Token that
Feature activation+0.051
we
Token we
Feature activation-0.056
perspective
Token perspective
Feature activation-0.033
.
Token.
Feature activation-0.221
"
Token "
Feature activation+0.127
What
TokenWhat
Feature activation+0.369
is
Token is
Feature activation+0.403
true
Token true
Feature activation+0.419
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-0.867
perspective
Token perspective
Feature activation+0.022
.
Token.
Feature activation-0.159
"
Token "
Feature activation+0.561
What
TokenWhat
Feature activation+0.321
is
Token is
Feature activation+0.265
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-0.731
perspective
Token perspective
Feature activation-0.025
.
Token.
Feature activation-0.098
"
Token "
Feature activation+0.637
What
TokenWhat
Feature activation+0.014
is
Token is
Feature activation+0.112
true
Token true
Feature activation+0.054
,
Token,
Feature activation-0.019
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.033
.
Token.
Feature activation-0.430
"
Token "
Feature activation-1.176
What
TokenWhat
Feature activation+0.200
is
Token is
Feature activation+0.075
true
Token true
Feature activation+0.547
,
Token,
Feature activation+0.002
though
Token though
Feature activation-0.526
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-0.932
perspective
Token perspective
Feature activation+0.019
.
Token.
Feature activation+0.051
"
Token "
Feature activation+0.141
What
TokenWhat
Feature activation+0.033
is
Token is
Feature activation+0.056
true
Token true
Feature activation+0.048
,
Token,
Feature activation+0.001
though
Token though
Feature activation-0.047
true
Token true
Feature activation+0.053
,
Token,
Feature activation-0.016
though
Token though
Feature activation-0.111
,
Token,
Feature activation+0.005
is
Token is
Feature activation+0.089
that
Token that
Feature activation+0.139
whatever
Token whatever
Feature activation+0.113
policy
Token policy
Feature activation-0.326
prescriptions
Token prescriptions
Feature activation-0.359
that
Token that
Feature activation+0.131
we
Token we
Feature activation+0.000
,
Token,
Feature activation-0.006
though
Token though
Feature activation-0.106
,
Token,
Feature activation+0.009
is
Token is
Feature activation+0.036
that
Token that
Feature activation+0.247
whatever
Token whatever
Feature activation+0.291
policy
Token policy
Feature activation-0.333
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation-0.035
,
Token,
Feature activation-0.013
though
Token though
Feature activation-0.150
,
Token,
Feature activation+0.002
is
Token is
Feature activation+0.048
that
Token that
Feature activation+0.302
whatever
Token whatever
Feature activation+0.263
policy
Token policy
Feature activation-0.233
prescriptions
Token prescriptions
Feature activation-0.163
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
,
Token,
Feature activation+0.008
is
Token is
Feature activation+0.034
that
Token that
Feature activation-0.005
whatever
Token whatever
Feature activation+0.107
policy
Token policy
Feature activation-0.003
prescriptions
Token prescriptions
Feature activation+0.166
that
Token that
Feature activation+0.054
we
Token we
Feature activation-0.120
've
Token've
Feature activation-0.262
been
Token been
Feature activation-0.153
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.022
that
Token that
Feature activation+0.066
whatever
Token whatever
Feature activation+0.120
policy
Token policy
Feature activation+0.059
prescriptions
Token prescriptions
Feature activation+0.177
that
Token that
Feature activation+0.057
we
Token we
Feature activation-0.058
've
Token've
Feature activation-0.223
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
is
Token is
Feature activation-0.011
that
Token that
Feature activation+0.045
whatever
Token whatever
Feature activation+0.109
policy
Token policy
Feature activation+0.058
prescriptions
Token prescriptions
Feature activation-0.019
that
Token that
Feature activation+0.129
we
Token we
Feature activation-0.059
've
Token've
Feature activation-0.083
been
Token been
Feature activation-0.079
proposing
Token proposing
Feature activation-0.056
don
Token don
Feature activation+0.000
.
Token.
Feature activation-0.044
"
Token "
Feature activation-0.031
What
TokenWhat
Feature activation-0.018
is
Token is
Feature activation-0.004
true
Token true
Feature activation-0.018
,
Token,
Feature activation+0.008
though
Token though
Feature activation-0.036
,
Token,
Feature activation+0.003
is
Token is
Feature activation-0.024
that
Token that
Feature activation-0.018
whatever
Token whatever
Feature activation-0.027
true
Token true
Feature activation+0.083
,
Token,
Feature activation+0.006
though
Token though
Feature activation-0.102
,
Token,
Feature activation+0.025
is
Token is
Feature activation+0.167
that
Token that
Feature activation+0.243
whatever
Token whatever
Feature activation+0.126
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-0.801
perspective
Token perspective
Feature activation+0.014
.
Token.
Feature activation-0.102
"
Token "
Feature activation+0.319
What
TokenWhat
Feature activation-0.004
is
Token is
Feature activation+0.123
true
Token true
Feature activation+0.065
,
Token,
Feature activation-0.039
though
Token though
Feature activation-0.153
perspective
Token perspective
Feature activation-0.113
.
Token.
Feature activation-0.179
"
Token "
Feature activation+0.005
What
TokenWhat
Feature activation+0.238
is
Token is
Feature activation+0.157
true
Token true
Feature activation+0.660
,
Token,
Feature activation-0.012
though
Token though
Feature activation-0.364
,
Token,
Feature activation+0.093
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-0.810
perspective
Token perspective
Feature activation+0.001
.
Token.
Feature activation-0.286
"
Token "
Feature activation-0.032
What
TokenWhat
Feature activation+0.105
is
Token is
Feature activation+0.167
true
Token true
Feature activation+0.119
,
Token,
Feature activation+0.032
though
Token though
Feature activation-0.250
,
Token,
Feature activation+0.041
is
Token is
Feature activation+0.076

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.08

Head 2: 0.07

Head 3: 0.08

Head 4: 0.08

Head 5: 0.06

Head 6: 0.08

Head 7: 0.09

Head 8: 0.10

Head 9: 0.08

Head 10: 0.08

Head 11: 0.10

Positive logits

ABV5.05

inventoryQuantity3.72

wcsstore3.63

IPA3.38

soDeliveryDate3.24

3.20

Beer3.11

brewed3.11

rencies3.10

teasp3.08

vomit3.05

hops2.98

cohol2.96

terness2.96

renheit2.90

brew2.88

��2.78

dayName2.77

chlor2.75

Poké2.69

Negative logits

responsiveness-2.70

responsive-2.59

accommod-2.54

corrective-2.53

Recomm-2.48

yon-2.39

accommodating-2.35

rehearsal-2.32

supportive-2.31

Downs-2.31

Lion-2.25

determin-2.23

oug-2.22

responsive-2.19

Work-2.17

Sketch-2.17

Response-2.14

elastic-2.11

cutter-2.10

Stretch-2.10

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

of
Token of
Feature activation+0.000
morality
Token morality
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Yet
TokenYet
Feature activation+0.000
new
Token new
Feature activation+0.000
research
Token research
Feature activation+0.000
show
Token show
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
as
Token as
Feature activation+0.000
all
Token all
Feature activation+0.000
questions
Token questions
Feature activation+0.000
are
Token are
Feature activation+0.000
fair
Token fair
Feature activation+0.000
game
Token game
Feature activation+0.000
for
Token for
Feature activation+0.000
any
Token any
Feature activation+0.000
segment
Token segment
Feature activation+0.000
.
Token.
Feature activation+0.000
This
Token This
Feature activation+0.000
.
Token.
Feature activation+0.000
Michaels
Token Michaels
Feature activation+0.000
can
Token can
Feature activation+0.000
pull
Token pull
Feature activation+0.000
off
Token off
Feature activation+0.000
incredible
Token incredible
Feature activation+0.000
high
Token high
Feature activation+0.000
spots
Token spots
Feature activation+0.000
,
Token,
Feature activation+0.000
grapple
Token grapple
Feature activation+0.000
with
Token with
Feature activation+0.000
s
Tokens
Feature activation+0.000
Big
Token Big
Feature activation+0.000
Pap
Token Pap
Feature activation+0.000
i
Tokeni
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
that
Token that
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
still
Token still
Feature activation+0.000
aspects
Token aspects
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
latter
Token latter
Feature activation+0.000
).
Token).
Feature activation+0.000
This
Token This
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
novel
Token novel
Feature activation+0.000
myst
Token myst
Feature activation+0.000
ifying
Tokenifying
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 11: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 0.884

<|endoftext|>
Token<|endoftext|>
Feature activation-3.985
perspective
Token perspective
Feature activation-0.042
.
Token.
Feature activation-0.055
"
Token "
Feature activation+0.080
What
TokenWhat
Feature activation-0.084
is
Token is
Feature activation-0.130
true
Token true
Feature activation-0.565
,
Token,
Feature activation+0.040
though
Token though
Feature activation-0.063
<|endoftext|>
Token<|endoftext|>
Feature activation-3.674
perspective
Token perspective
Feature activation-0.006
.
Token.
Feature activation+0.017
"
Token "
Feature activation+0.158
What
TokenWhat
Feature activation-0.299
is
Token is
Feature activation-0.134
true
Token true
Feature activation-0.309
,
Token,
Feature activation+0.070
though
Token though
Feature activation-0.135
prescriptions
Token prescriptions
Feature activation+0.005
that
Token that
Feature activation-0.031
we
Token we
Feature activation-0.006
've
Token've
Feature activation-0.046
been
Token been
Feature activation-0.130
proposing
Token proposing
Feature activation+0.147
don
Token don
Feature activation-0.385
't
Token't
Feature activation-0.188
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.372
,
Token,
Feature activation+0.005
though
Token though
Feature activation-0.164
,
Token,
Feature activation+0.050
is
Token is
Feature activation+0.097
that
Token that
Feature activation+0.351
whatever
Token whatever
Feature activation-0.307
policy
Token policy
Feature activation-0.367
prescriptions
Token prescriptions
Feature activation-0.218
that
Token that
Feature activation-0.003
we
Token we
Feature activation+0.036
<|endoftext|>
Token<|endoftext|>
Feature activation-4.938
perspective
Token perspective
Feature activation-0.068
.
Token.
Feature activation-0.035
"
Token "
Feature activation+0.329
What
TokenWhat
Feature activation-0.157
is
Token is
Feature activation+0.080
true
Token true
Feature activation-0.286
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.030
perspective
Token perspective
Feature activation-0.025
.
Token.
Feature activation+0.016
"
Token "
Feature activation+0.717
What
TokenWhat
Feature activation-0.090
is
Token is
Feature activation-0.032
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.023
perspective
Token perspective
Feature activation-0.172
.
Token.
Feature activation+0.114
"
Token "
Feature activation+0.884
What
TokenWhat
Feature activation-0.469
is
Token is
Feature activation-0.224
true
Token true
Feature activation-0.861
,
Token,
Feature activation+0.081
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.594
perspective
Token perspective
Feature activation-0.129
.
Token.
Feature activation+0.105
"
Token "
Feature activation-0.587
What
TokenWhat
Feature activation-0.219
is
Token is
Feature activation-0.118
true
Token true
Feature activation-0.170
,
Token,
Feature activation+0.064
<|endoftext|>
Token<|endoftext|>
Feature activation-5.252
perspective
Token perspective
Feature activation+0.002
.
Token.
Feature activation+0.002
"
Token "
Feature activation+0.100
What
TokenWhat
Feature activation-0.059
is
Token is
Feature activation-0.081
true
Token true
Feature activation-0.027
,
Token,
Feature activation+0.047
though
Token though
Feature activation-0.064
<|endoftext|>
Token<|endoftext|>
Feature activation-4.561
perspective
Token perspective
Feature activation-0.019
.
Token.
Feature activation-0.020
"
Token "
Feature activation+0.197
What
TokenWhat
Feature activation-0.043
is
Token is
Feature activation-0.069
true
Token true
Feature activation-0.105
,
Token,
Feature activation+0.048
though
Token though
Feature activation-0.151
true
Token true
Feature activation-0.123
,
Token,
Feature activation+0.054
though
Token though
Feature activation-0.136
,
Token,
Feature activation+0.004
is
Token is
Feature activation+0.071
that
Token that
Feature activation+0.238
whatever
Token whatever
Feature activation-0.231
policy
Token policy
Feature activation+0.137
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
true
Token true
Feature activation-0.356
,
Token,
Feature activation+0.007
though
Token though
Feature activation-0.195
,
Token,
Feature activation-0.052
is
Token is
Feature activation+0.074
that
Token that
Feature activation+0.440
whatever
Token whatever
Feature activation-0.458
policy
Token policy
Feature activation+0.077
prescriptions
Token prescriptions
Feature activation-0.354
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
is
Token is
Feature activation-0.040
true
Token true
Feature activation-0.028
,
Token,
Feature activation+0.014
though
Token though
Feature activation-0.051
,
Token,
Feature activation+0.022
is
Token is
Feature activation+0.076
that
Token that
Feature activation-0.044
whatever
Token whatever
Feature activation-0.108
policy
Token policy
Feature activation-0.101
prescriptions
Token prescriptions
Feature activation-0.500
that
Token that
Feature activation-0.114
is
Token is
Feature activation-0.054
true
Token true
Feature activation-0.033
,
Token,
Feature activation+0.020
though
Token though
Feature activation-0.081
,
Token,
Feature activation+0.006
is
Token is
Feature activation+0.054
that
Token that
Feature activation-0.058
whatever
Token whatever
Feature activation-0.121
policy
Token policy
Feature activation-0.090
prescriptions
Token prescriptions
Feature activation-0.539
that
Token that
Feature activation-0.129
prescriptions
Token prescriptions
Feature activation-0.071
that
Token that
Feature activation-0.051
we
Token we
Feature activation-0.079
've
Token've
Feature activation-0.040
been
Token been
Feature activation-0.150
proposing
Token proposing
Feature activation+0.330
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.203
policy
Token policy
Feature activation-0.020
prescriptions
Token prescriptions
Feature activation-0.187
that
Token that
Feature activation-0.040
we
Token we
Feature activation-0.003
've
Token've
Feature activation+0.059
been
Token been
Feature activation-0.027
proposing
Token proposing
Feature activation-0.015
don
Token don
Feature activation-2.508
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
true
Token true
Feature activation-0.314
,
Token,
Feature activation+0.054
though
Token though
Feature activation-0.143
,
Token,
Feature activation+0.020
is
Token is
Feature activation+0.212
that
Token that
Feature activation+0.373
whatever
Token whatever
Feature activation-0.344
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.443
perspective
Token perspective
Feature activation+0.002
.
Token.
Feature activation+0.044
"
Token "
Feature activation+0.482
What
TokenWhat
Feature activation-0.118
is
Token is
Feature activation-0.049
true
Token true
Feature activation-0.295
,
Token,
Feature activation+0.032
though
Token though
Feature activation-0.198
<|endoftext|>
Token<|endoftext|>
Feature activation-4.311
perspective
Token perspective
Feature activation-0.225
.
Token.
Feature activation+0.010
"
Token "
Feature activation+0.243
What
TokenWhat
Feature activation-0.321
is
Token is
Feature activation-0.150
true
Token true
Feature activation-0.088
,
Token,
Feature activation+0.078
though
Token though
Feature activation-0.545
<|endoftext|>
Token<|endoftext|>
Feature activation-4.306
perspective
Token perspective
Feature activation-0.033
.
Token.
Feature activation+0.037
"
Token "
Feature activation+0.226
What
TokenWhat
Feature activation-0.190
is
Token is
Feature activation-0.065
true
Token true
Feature activation-0.698
,
Token,
Feature activation+0.103
though
Token though
Feature activation-0.356

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.08

Head 2: 0.09

Head 3: 0.08

Head 4: 0.08

Head 5: 0.08

Head 6: 0.08

Head 7: 0.08

Head 8: 0.08

Head 9: 0.08

Head 10: 0.08

Head 11: 0.10

Positive logits

mot2.71

ebus2.65

ocratic2.62

Helpful2.57

abouts2.55

privileges2.52

cong2.51

conflic2.49

Geneva2.47

recurring2.45

Campaign2.42

discrep2.41

ous2.38

bipartisan2.37

inconsistent2.36

iances2.36

habitual2.35

Priv2.35

Guest2.34

affili2.33

Negative logits

NXT-3.03

Merchants-2.88

Sears-2.86

lite-2.85

sculpture-2.84

Thorn-2.83

Lak-2.81

NRL-2.75

Daly-2.69

Sass-2.66

HM-2.65

UST-2.63

lenders-2.63

Shapiro-2.60

heid-2.58

SL-2.57

Sutherland-2.57

Slave-2.55

Barker-2.55

Mick-2.55

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

ilot
Tokenilot
Feature activation+0.000
we
Token we
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
ve
Tokenve
Feature activation+0.000
continuously
Token continuously
Feature activation+0.000
educated
Token educated
Feature activation+0.000
customers
Token customers
Feature activation+0.000
on
Token on
Feature activation+0.000
the
Token the
Feature activation+0.000
use
Token use
Feature activation+0.000
in
Token in
Feature activation+0.000
a
Token a
Feature activation+0.000
way
Token way
Feature activation+0.000
that
Token that
Feature activation+0.000
I
Token I
Feature activation+0.000
couldn
Token couldn
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
enjoy
Token enjoy
Feature activation+0.000
my
Token my
Feature activation+0.000
Qatar
Token Qatar
Feature activation+0.000
,
Token,
Feature activation+0.000
in
Token in
Feature activation+0.000
Cairo
Token Cairo
Feature activation+0.000
,
Token,
Feature activation+0.000
Egypt
Token Egypt
Feature activation+0.000
,
Token,
Feature activation+0.000
July
Token July
Feature activation+0.000
5
Token 5
Feature activation+0.000
,
Token,
Feature activation+0.000
2017
Token 2017
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
In
TokenIn
Feature activation+0.000
order
Token order
Feature activation+0.000
to
Token to
Feature activation+0.000
escape
Token escape
Feature activation+0.000
from
Token from
Feature activation+0.000
this
Token this
Feature activation+0.000
endless
Token endless
Feature activation+0.000
time
Token time
Feature activation+0.000
loop
Token loop
Feature activation+0.000
,
Token,
Feature activation+0.000
despite
Token despite
Feature activation+0.000
Mr
Token Mr
Feature activation+0.000
.
Token.
Feature activation+0.000
Trump
Token Trump
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
executive
Token executive
Feature activation+0.000
order
Token order
Feature activation+0.000
freezing
Token freezing
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 12: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 0.772

<|endoftext|>
Token<|endoftext|>
Feature activation-5.479
perspective
Token perspective
Feature activation-0.083
.
Token.
Feature activation-0.137
"
Token "
Feature activation+0.040
What
TokenWhat
Feature activation-0.165
is
Token is
Feature activation-0.293
true
Token true
Feature activation-1.044
,
Token,
Feature activation-0.138
though
Token though
Feature activation-0.112
<|endoftext|>
Token<|endoftext|>
Feature activation-5.114
perspective
Token perspective
Feature activation-0.027
.
Token.
Feature activation-0.063
"
Token "
Feature activation+0.106
What
TokenWhat
Feature activation-0.428
is
Token is
Feature activation-0.264
true
Token true
Feature activation-0.563
,
Token,
Feature activation-0.185
though
Token though
Feature activation-0.225
prescriptions
Token prescriptions
Feature activation+0.149
that
Token that
Feature activation-0.017
we
Token we
Feature activation-0.134
've
Token've
Feature activation-0.081
been
Token been
Feature activation-0.147
proposing
Token proposing
Feature activation+0.244
don
Token don
Feature activation-1.409
't
Token't
Feature activation-0.393
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
is
Token is
Feature activation-0.098
that
Token that
Feature activation+0.014
whatever
Token whatever
Feature activation-0.312
policy
Token policy
Feature activation-0.344
prescriptions
Token prescriptions
Feature activation-0.094
that
Token that
Feature activation+0.035
we
Token we
Feature activation-0.034
've
Token've
Feature activation+0.019
been
Token been
Feature activation-0.092
proposing
Token proposing
Feature activation-0.144
don
Token don
Feature activation-0.200
<|endoftext|>
Token<|endoftext|>
Feature activation-6.930
perspective
Token perspective
Feature activation-0.142
.
Token.
Feature activation-0.152
"
Token "
Feature activation+0.243
What
TokenWhat
Feature activation-0.221
is
Token is
Feature activation-0.503
true
Token true
Feature activation-0.643
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.081
perspective
Token perspective
Feature activation-0.111
.
Token.
Feature activation-0.211
"
Token "
Feature activation+0.522
What
TokenWhat
Feature activation-0.209
is
Token is
Feature activation-0.577
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.692
perspective
Token perspective
Feature activation-0.330
.
Token.
Feature activation-0.076
"
Token "
Feature activation+0.772
What
TokenWhat
Feature activation-0.674
is
Token is
Feature activation-0.639
true
Token true
Feature activation-1.728
,
Token,
Feature activation-0.460
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.371
perspective
Token perspective
Feature activation-0.192
.
Token.
Feature activation+0.400
"
Token "
Feature activation+0.231
What
TokenWhat
Feature activation-0.281
is
Token is
Feature activation-0.348
true
Token true
Feature activation-0.364
,
Token,
Feature activation-0.280
,
Token,
Feature activation-0.098
is
Token is
Feature activation-0.183
that
Token that
Feature activation-0.628
whatever
Token whatever
Feature activation-0.209
policy
Token policy
Feature activation-0.316
prescriptions
Token prescriptions
Feature activation+0.030
that
Token that
Feature activation-0.290
we
Token we
Feature activation-0.171
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.311
perspective
Token perspective
Feature activation-0.033
.
Token.
Feature activation-0.114
"
Token "
Feature activation+0.133
What
TokenWhat
Feature activation-0.142
is
Token is
Feature activation-0.290
true
Token true
Feature activation-0.229
,
Token,
Feature activation-0.186
though
Token though
Feature activation-0.279
<|endoftext|>
Token<|endoftext|>
Feature activation-6.747
perspective
Token perspective
Feature activation-0.057
.
Token.
Feature activation-0.190
"
Token "
Feature activation+0.059
What
TokenWhat
Feature activation-0.174
is
Token is
Feature activation-0.454
true
Token true
Feature activation-0.285
,
Token,
Feature activation-0.176
though
Token though
Feature activation-0.261
<|endoftext|>
Token<|endoftext|>
Feature activation-5.644
perspective
Token perspective
Feature activation-0.042
.
Token.
Feature activation-0.214
"
Token "
Feature activation+0.189
What
TokenWhat
Feature activation-0.172
is
Token is
Feature activation-0.365
true
Token true
Feature activation-0.545
,
Token,
Feature activation-0.222
though
Token though
Feature activation-0.374
,
Token,
Feature activation-0.071
is
Token is
Feature activation-0.105
that
Token that
Feature activation-0.459
whatever
Token whatever
Feature activation-0.194
policy
Token policy
Feature activation-0.198
prescriptions
Token prescriptions
Feature activation+0.258
that
Token that
Feature activation-0.183
we
Token we
Feature activation-0.199
've
Token've
Feature activation-0.206
been
Token been
Feature activation-0.888
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.056
is
Token is
Feature activation-0.113
that
Token that
Feature activation-0.569
whatever
Token whatever
Feature activation-0.204
policy
Token policy
Feature activation-0.167
prescriptions
Token prescriptions
Feature activation+0.324
that
Token that
Feature activation-0.157
we
Token we
Feature activation+0.001
've
Token've
Feature activation-0.459
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.248
that
Token that
Feature activation-0.009
we
Token we
Feature activation-0.119
've
Token've
Feature activation-0.099
been
Token been
Feature activation-0.192
proposing
Token proposing
Feature activation+0.378
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.213
policy
Token policy
Feature activation-0.040
prescriptions
Token prescriptions
Feature activation-0.250
that
Token that
Feature activation-0.060
we
Token we
Feature activation-0.009
've
Token've
Feature activation+0.070
been
Token been
Feature activation-0.040
proposing
Token proposing
Feature activation-0.022
don
Token don
Feature activation-9.601
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.229
perspective
Token perspective
Feature activation+0.027
.
Token.
Feature activation-0.122
"
Token "
Feature activation+0.172
What
TokenWhat
Feature activation-0.150
is
Token is
Feature activation-0.446
true
Token true
Feature activation-0.636
,
Token,
Feature activation-0.257
though
Token though
Feature activation-0.251
<|endoftext|>
Token<|endoftext|>
Feature activation-6.173
perspective
Token perspective
Feature activation-0.045
.
Token.
Feature activation-0.077
"
Token "
Feature activation+0.408
What
TokenWhat
Feature activation-0.219
is
Token is
Feature activation-0.278
true
Token true
Feature activation-0.605
,
Token,
Feature activation-0.282
though
Token though
Feature activation-0.356
<|endoftext|>
Token<|endoftext|>
Feature activation-5.985
perspective
Token perspective
Feature activation-0.279
.
Token.
Feature activation-0.096
"
Token "
Feature activation+0.212
What
TokenWhat
Feature activation-0.352
is
Token is
Feature activation-0.403
true
Token true
Feature activation-0.779
,
Token,
Feature activation-0.310
though
Token though
Feature activation-0.936
<|endoftext|>
Token<|endoftext|>
Feature activation-5.991
perspective
Token perspective
Feature activation-0.085
.
Token.
Feature activation+0.113
"
Token "
Feature activation+0.361
What
TokenWhat
Feature activation-0.318
is
Token is
Feature activation-0.364
true
Token true
Feature activation-1.451
,
Token,
Feature activation-0.347
though
Token though
Feature activation-0.621

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.08

Head 2: 0.09

Head 3: 0.08

Head 4: 0.09

Head 5: 0.09

Head 6: 0.09

Head 7: 0.06

Head 8: 0.09

Head 9: 0.07

Head 10: 0.09

Head 11: 0.09

Positive logits

seiz2.80

2.78

condem2.72

2.71

atform2.67

ovan2.62

gyn2.61

2.52

filament2.50

Shib2.50

Hib2.47

Ramadan2.46

ende2.45

2.44

Nish2.40

Blu2.39

dim2.39

whit2.38

2.38

Paraly2.36

Negative logits

Savannah-2.99

addock-2.70

Queens-2.66

driving-2.52

ricks-2.41

ighters-2.39

eston-2.36

explode-2.35

Aust-2.35

iments-2.31

imental-2.29

prototype-2.27

ACS-2.27

etc-2.25

assault-2.20

Socialism-2.18

tsky-2.18

driver-2.17

smash-2.15

rian-2.14

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

----------------------------------------------------------------
Token----------------------------------------------------------------
Feature activation+0.000
-------
Token-------
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
25
Token25
Feature activation+0.000
/
Token/
Feature activation+0.000
12
Token12
Feature activation+0.000
/
Token/
Feature activation+0.000
15
Token15
Feature activation+0.000
-
Token -
Feature activation+0.000
Last
Token Last
Feature activation+0.000
ask
Token ask
Feature activation+0.000
for
Token for
Feature activation+0.000
your
Token your
Feature activation+0.000
understanding
Token understanding
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
This
TokenThis
Feature activation+0.000
all
Token all
Feature activation+0.000
happened
Token happened
Feature activation+0.000
to
Token to
Feature activation+0.000
William
Token William
Feature activation+0.000
B
Token B
Feature activation+0.000
.
Token.
Feature activation+0.000
Davis
Token Davis
Feature activation+0.000
,
Token,
Feature activation+0.000
click
Token click
Feature activation+0.000
the
Token the
Feature activation+0.000
audio
Token audio
Feature activation+0.000
labelled
Token labelled
Feature activation+0.000
:
Token:
Feature activation+0.000
The
Token The
Feature activation+0.000
lighting
Token lighting
Feature activation+0.000
of
Token of
Feature activation+0.000
candles
Token candles
Feature activation+0.000
on
Token on
Feature activation+0.000
Friday
Token Friday
Feature activation+0.000
nights
Token nights
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
So
TokenSo
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
brought
Token brought
Feature activation+0.000
on
Token on
Feature activation+0.000
a
Token a
Feature activation+0.000
heart
Token heart
Feature activation+0.000
attack
Token attack
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
ĺ
Tokenĺ
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 13: Ultra low frequency cluster

TOP ACTIVATIONS
MAX = 0.368

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+0.368
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 17.036

the
Token the
Feature activation-0.030
team
Token team
Feature activation-0.009
has
Token has
Feature activation-0.000
signed
Token signed
Feature activation-0.009
forward
Token forward
Feature activation+0.017
Tyson
Token Tyson
Feature activation+17.036
J
Token J
Feature activation-0.161
ost
Tokenost
Feature activation-0.263
(
Token (
Feature activation-0.028
J
TokenJ
Feature activation-0.060
OH
TokenOH
Feature activation-0.062
<|endoftext|>
Token<|endoftext|>
Feature activation-6.376
perspective
Token perspective
Feature activation-0.058
.
Token.
Feature activation-0.001
"
Token "
Feature activation+0.268
What
TokenWhat
Feature activation-0.604
is
Token is
Feature activation-0.258
true
Token true
Feature activation-0.517
,
Token,
Feature activation+0.068
though
Token though
Feature activation-0.304
whatever
Token whatever
Feature activation-0.222
policy
Token policy
Feature activation-0.073
prescriptions
Token prescriptions
Feature activation-0.303
that
Token that
Feature activation-0.082
we
Token we
Feature activation-0.009
've
Token've
Feature activation+0.075
been
Token been
Feature activation-0.030
proposing
Token proposing
Feature activation-0.033
don
Token don
Feature activation-3.669
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
true
Token true
Feature activation-0.678
,
Token,
Feature activation+0.033
though
Token though
Feature activation-0.453
,
Token,
Feature activation-0.058
is
Token is
Feature activation-0.010
that
Token that
Feature activation+0.362
whatever
Token whatever
Feature activation-0.535
policy
Token policy
Feature activation-0.464
prescriptions
Token prescriptions
Feature activation-0.329
that
Token that
Feature activation-0.096
we
Token we
Feature activation+0.028
<|endoftext|>
Token<|endoftext|>
Feature activation-8.554
perspective
Token perspective
Feature activation-0.153
.
Token.
Feature activation-0.211
"
Token "
Feature activation+0.552
What
TokenWhat
Feature activation-0.324
is
Token is
Feature activation-0.466
true
Token true
Feature activation-0.665
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.809
perspective
Token perspective
Feature activation-0.113
.
Token.
Feature activation-0.092
"
Token "
Feature activation+1.348
What
TokenWhat
Feature activation-0.303
is
Token is
Feature activation-0.523
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.187
perspective
Token perspective
Feature activation-0.337
.
Token.
Feature activation+0.116
"
Token "
Feature activation+1.701
What
TokenWhat
Feature activation-1.008
is
Token is
Feature activation-0.612
true
Token true
Feature activation-1.530
,
Token,
Feature activation+0.103
though
Token though
Feature activation+0.000
whatever
Token whatever
Feature activation-0.439
policy
Token policy
Feature activation-0.169
prescriptions
Token prescriptions
Feature activation+0.012
that
Token that
Feature activation-0.132
we
Token we
Feature activation+0.079
've
Token've
Feature activation+0.114
been
Token been
Feature activation-0.039
proposing
Token proposing
Feature activation+0.032
don
Token don
Feature activation-0.711
't
Token't
Feature activation-0.448
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation-0.100
is
Token is
Feature activation-0.047
that
Token that
Feature activation-0.404
whatever
Token whatever
Feature activation-0.341
policy
Token policy
Feature activation-0.131
prescriptions
Token prescriptions
Feature activation+0.735
that
Token that
Feature activation-0.402
we
Token we
Feature activation+0.054
've
Token've
Feature activation-0.381
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
true
Token true
Feature activation-0.575
,
Token,
Feature activation+0.007
though
Token though
Feature activation-0.534
,
Token,
Feature activation-0.189
is
Token is
Feature activation-0.085
that
Token that
Feature activation+0.136
whatever
Token whatever
Feature activation-0.859
policy
Token policy
Feature activation-0.077
prescriptions
Token prescriptions
Feature activation-0.129
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.576
perspective
Token perspective
Feature activation-0.017
.
Token.
Feature activation-0.023
"
Token "
Feature activation+0.384
What
TokenWhat
Feature activation-0.307
is
Token is
Feature activation-0.374
true
Token true
Feature activation-0.719
,
Token,
Feature activation+0.083
though
Token though
Feature activation-0.357
<|endoftext|>
Token<|endoftext|>
Feature activation-8.153
perspective
Token perspective
Feature activation-0.071
.
Token.
Feature activation-0.159
"
Token "
Feature activation+0.126
What
TokenWhat
Feature activation-0.038
is
Token is
Feature activation-0.407
true
Token true
Feature activation-0.404
,
Token,
Feature activation+0.036
though
Token though
Feature activation-0.379
,
Token,
Feature activation-0.091
is
Token is
Feature activation+0.017
that
Token that
Feature activation-0.332
whatever
Token whatever
Feature activation-0.301
policy
Token policy
Feature activation-0.203
prescriptions
Token prescriptions
Feature activation+0.649
that
Token that
Feature activation-0.297
we
Token we
Feature activation+0.028
've
Token've
Feature activation+0.095
been
Token been
Feature activation-0.577
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.096
is
Token is
Feature activation-0.080
that
Token that
Feature activation-0.435
whatever
Token whatever
Feature activation-0.280
policy
Token policy
Feature activation-0.312
prescriptions
Token prescriptions
Feature activation+0.421
that
Token that
Feature activation-0.419
we
Token we
Feature activation-0.111
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.117
is
Token is
Feature activation-0.193
that
Token that
Feature activation-0.722
whatever
Token whatever
Feature activation-0.623
policy
Token policy
Feature activation+0.183
prescriptions
Token prescriptions
Feature activation+0.332
that
Token that
Feature activation-0.289
we
Token we
Feature activation-0.133
've
Token've
Feature activation+0.116
been
Token been
Feature activation-0.067
proposing
Token proposing
Feature activation+0.083
<|endoftext|>
Token<|endoftext|>
Feature activation-7.310
perspective
Token perspective
Feature activation-0.202
.
Token.
Feature activation-0.094
"
Token "
Feature activation+0.311
What
TokenWhat
Feature activation-0.792
is
Token is
Feature activation-0.421
true
Token true
Feature activation-0.286
,
Token,
Feature activation+0.086
though
Token though
Feature activation-1.544
<|endoftext|>
Token<|endoftext|>
Feature activation-7.608
perspective
Token perspective
Feature activation-0.052
.
Token.
Feature activation-0.045
"
Token "
Feature activation+0.302
What
TokenWhat
Feature activation-0.143
is
Token is
Feature activation-0.253
true
Token true
Feature activation-0.292
,
Token,
Feature activation+0.017
though
Token though
Feature activation-0.402
<|endoftext|>
Token<|endoftext|>
Feature activation-7.613
perspective
Token perspective
Feature activation-0.095
.
Token.
Feature activation-0.012
"
Token "
Feature activation+0.833
What
TokenWhat
Feature activation-0.384
is
Token is
Feature activation-0.216
true
Token true
Feature activation-0.516
,
Token,
Feature activation+0.036
though
Token though
Feature activation-0.535
.
Token.
Feature activation+0.094
"
Token "
Feature activation-0.310
What
TokenWhat
Feature activation-0.503
is
Token is
Feature activation-0.279
true
Token true
Feature activation+0.146
,
Token,
Feature activation+0.179
though
Token though
Feature activation-1.966
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.430
perspective
Token perspective
Feature activation-0.091
.
Token.
Feature activation-0.019
"
Token "
Feature activation+0.582
What
TokenWhat
Feature activation-0.387
is
Token is
Feature activation-0.283
true
Token true
Feature activation-1.292
,
Token,
Feature activation+0.122
though
Token though
Feature activation-0.942

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.09

Head 2: 0.08

Head 3: 0.08

Head 4: 0.08

Head 5: 0.09

Head 6: 0.08

Head 7: 0.08

Head 8: 0.08

Head 9: 0.08

Head 10: 0.09

Head 11: 0.09

Positive logits

zos3.09

ipeg3.06

Berks2.97

Franks2.94

Muslims2.74

Kund2.67

comed2.66

Quran2.58

Jewish2.54

mus2.50

RANT2.49

Services2.48

Koran2.47

caut2.39

paramedics2.39

Muslim2.38

Colorado2.34

Pesh2.34

NYC2.33

onian2.32

Negative logits

stub-2.54

Unlock-2.44

hall-2.42

-2.37

Brawl-2.35

.」-2.32

disag-2.31

adium-2.29

weeds-2.26

complicate-2.25

yrights-2.24

dur-2.24

arenas-2.22

Ish-2.22

-2.20

Yam-2.19

unfold-2.18

BALL-2.18

arena-2.18

Struct-2.16

INTERVAL 0.331 - 0.368
CONTAINS 0.000%

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+0.368
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000

INTERVAL 0.294 - 0.331
CONTAINS 0.000%

INTERVAL 0.257 - 0.294
CONTAINS 0.000%

INTERVAL 0.221 - 0.257
CONTAINS 0.000%

INTERVAL 0.184 - 0.221
CONTAINS 0.000%

INTERVAL 0.147 - 0.184
CONTAINS 0.000%

INTERVAL 0.110 - 0.147
CONTAINS 0.000%

INTERVAL 0.074 - 0.110
CONTAINS 0.000%

INTERVAL 0.037 - 0.074
CONTAINS 0.000%

INTERVAL 0.000 - 0.037
CONTAINS 100.000%

detectable
Token detectable
Feature activation+0.000
virus
Token virus
Feature activation+0.000
âĢĵ
Token âĢĵ
Feature activation+0.000
and
Token and
Feature activation+0.000
daily
Token daily
Feature activation+0.000
drugs
Token drugs
Feature activation+0.000
âĢĵ
Token âĢĵ
Feature activation+0.000
thanks
Token thanks
Feature activation+0.000
to
Token to
Feature activation+0.000
a
Token a
Feature activation+0.000
new
Token new
Feature activation+0.000
cas
Token cas
Feature activation+0.000
hews
Tokenhews
Feature activation+0.000
?
Token?
Feature activation+0.000
You
Token You
Feature activation+0.000
should
Token should
Feature activation+0.000
try
Token try
Feature activation+0.000
my
Token my
Feature activation+0.000
Cas
Token Cas
Feature activation+0.000
hew
Tokenhew
Feature activation+0.000
Che
Token Che
Feature activation+0.000
es
Tokenes
Feature activation+0.000
hop
Token hop
Feature activation+0.000
heads
Tokenheads
Feature activation+0.000
all
Token all
Feature activation+0.000
along
Token along
Feature activation+0.000
Lake
Token Lake
Feature activation+0.000
Michigan
Token Michigan
Feature activation+0.000
:
Token:
Feature activation+0.000
instead
Token instead
Feature activation+0.000
of
Token of
Feature activation+0.000
simply
Token simply
Feature activation+0.000
being
Token being
Feature activation+0.000
government
Token government
Feature activation+0.000
to
Token to
Feature activation+0.000
help
Token help
Feature activation+0.000
them
Token them
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
In
TokenIn
Feature activation+0.000
March
Token March
Feature activation+0.000
,
Token,
Feature activation+0.000
one
Token one
Feature activation+0.000
anything
Token anything
Feature activation+0.000
,
Token,
Feature activation+0.000
so
Token so
Feature activation+0.000
if
Token if
Feature activation+0.000
we
Token we
Feature activation+0.000
could
Token could
Feature activation+0.000
speak
Token speak
Feature activation+0.000
a
Token a
Feature activation+0.000
little
Token little
Feature activation+0.000
bit
Token bit
Feature activation+0.000
more
Token more
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 14: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.021

been
Token been
Feature activation-0.038
proposing
Token proposing
Feature activation-0.062
don
Token don
Feature activation-0.043
't
Token't
Feature activation-0.302
reach
Token reach
Feature activation-0.819
,
Token,
Feature activation+0.320
are
Token are
Feature activation-0.198
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
been
Token been
Feature activation-0.034
proposing
Token proposing
Feature activation-0.043
don
Token don
Feature activation-0.023
't
Token't
Feature activation-0.302
reach
Token reach
Feature activation-0.520
,
Token,
Feature activation+0.454
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation-0.051
whatever
Token whatever
Feature activation-0.122
policy
Token policy
Feature activation-0.075
prescriptions
Token prescriptions
Feature activation+0.052
that
Token that
Feature activation-0.011
we
Token we
Feature activation+0.141
've
Token've
Feature activation+0.013
been
Token been
Feature activation+0.006
proposing
Token proposing
Feature activation+0.098
don
Token don
Feature activation-0.190
't
Token't
Feature activation-0.333
true
Token true
Feature activation-0.122
,
Token,
Feature activation-0.025
though
Token though
Feature activation-0.135
,
Token,
Feature activation+0.016
is
Token is
Feature activation+0.017
that
Token that
Feature activation+0.279
whatever
Token whatever
Feature activation-0.115
policy
Token policy
Feature activation-0.187
prescriptions
Token prescriptions
Feature activation-0.073
that
Token that
Feature activation+0.004
we
Token we
Feature activation+0.040
<|endoftext|>
Token<|endoftext|>
Feature activation-1.167
perspective
Token perspective
Feature activation+0.056
.
Token.
Feature activation-0.089
"
Token "
Feature activation+0.363
What
TokenWhat
Feature activation+0.362
is
Token is
Feature activation+0.180
true
Token true
Feature activation+0.306
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.214
perspective
Token perspective
Feature activation+0.085
.
Token.
Feature activation+0.017
"
Token "
Feature activation+0.862
What
TokenWhat
Feature activation+0.368
is
Token is
Feature activation+0.022
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-0.969
perspective
Token perspective
Feature activation+0.046
.
Token.
Feature activation+0.106
"
Token "
Feature activation+1.021
What
TokenWhat
Feature activation+0.066
is
Token is
Feature activation-0.114
true
Token true
Feature activation-0.027
,
Token,
Feature activation+0.004
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation+0.052
.
Token.
Feature activation-0.113
"
Token "
Feature activation-0.392
What
TokenWhat
Feature activation+0.251
is
Token is
Feature activation-0.056
true
Token true
Feature activation+0.903
,
Token,
Feature activation-0.040
though
Token though
Feature activation-0.472
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.367
perspective
Token perspective
Feature activation+0.019
.
Token.
Feature activation+0.054
"
Token "
Feature activation+0.160
What
TokenWhat
Feature activation+0.051
is
Token is
Feature activation-0.029
true
Token true
Feature activation+0.001
,
Token,
Feature activation+0.034
though
Token though
Feature activation-0.055
<|endoftext|>
Token<|endoftext|>
Feature activation-1.137
perspective
Token perspective
Feature activation+0.020
.
Token.
Feature activation+0.003
"
Token "
Feature activation+0.278
What
TokenWhat
Feature activation+0.130
is
Token is
Feature activation-0.017
true
Token true
Feature activation-0.026
,
Token,
Feature activation+0.017
though
Token though
Feature activation-0.109
<|endoftext|>
Token<|endoftext|>
Feature activation-1.186
perspective
Token perspective
Feature activation+0.004
.
Token.
Feature activation-0.008
"
Token "
Feature activation+0.199
What
TokenWhat
Feature activation+0.231
is
Token is
Feature activation-0.099
true
Token true
Feature activation-0.066
,
Token,
Feature activation+0.030
though
Token though
Feature activation-0.124
,
Token,
Feature activation-0.002
<|endoftext|>
Token<|endoftext|>
Feature activation-1.061
perspective
Token perspective
Feature activation+0.007
.
Token.
Feature activation-0.027
"
Token "
Feature activation+0.341
What
TokenWhat
Feature activation+0.188
is
Token is
Feature activation-0.051
true
Token true
Feature activation-0.104
,
Token,
Feature activation-0.007
though
Token though
Feature activation-0.158
,
Token,
Feature activation+0.030
is
Token is
Feature activation+0.015
that
Token that
Feature activation-0.074
whatever
Token whatever
Feature activation-0.023
policy
Token policy
Feature activation-0.077
prescriptions
Token prescriptions
Feature activation+0.199
that
Token that
Feature activation-0.073
we
Token we
Feature activation+0.083
've
Token've
Feature activation-0.165
been
Token been
Feature activation-0.097
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation+0.010
is
Token is
Feature activation+0.005
that
Token that
Feature activation-0.043
whatever
Token whatever
Feature activation-0.019
policy
Token policy
Feature activation-0.006
prescriptions
Token prescriptions
Feature activation+0.204
that
Token that
Feature activation-0.081
we
Token we
Feature activation+0.056
've
Token've
Feature activation-0.178
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.122
that
Token that
Feature activation-0.128
we
Token we
Feature activation-0.017
've
Token've
Feature activation-0.048
been
Token been
Feature activation-0.001
proposing
Token proposing
Feature activation+0.142
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation-0.024
we
Token we
Feature activation+0.000
've
Token've
Feature activation-0.031
been
Token been
Feature activation-0.003
proposing
Token proposing
Feature activation-0.014
don
Token don
Feature activation+0.183
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.095
perspective
Token perspective
Feature activation+0.056
.
Token.
Feature activation+0.028
"
Token "
Feature activation+0.348
What
TokenWhat
Feature activation+0.110
is
Token is
Feature activation-0.090
true
Token true
Feature activation-0.054
,
Token,
Feature activation+0.010
though
Token though
Feature activation-0.105
<|endoftext|>
Token<|endoftext|>
Feature activation-1.047
perspective
Token perspective
Feature activation+0.029
.
Token.
Feature activation+0.033
"
Token "
Feature activation+0.551
What
TokenWhat
Feature activation+0.046
is
Token is
Feature activation-0.030
true
Token true
Feature activation-0.007
,
Token,
Feature activation-0.017
though
Token though
Feature activation-0.159
perspective
Token perspective
Feature activation+0.165
.
Token.
Feature activation-0.035
"
Token "
Feature activation+0.276
What
TokenWhat
Feature activation+0.181
is
Token is
Feature activation-0.044
true
Token true
Feature activation+0.829
,
Token,
Feature activation-0.012
though
Token though
Feature activation-0.363
,
Token,
Feature activation+0.012
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.010
perspective
Token perspective
Feature activation+0.080
.
Token.
Feature activation-0.086
"
Token "
Feature activation+0.404
What
TokenWhat
Feature activation+0.159
is
Token is
Feature activation+0.025
true
Token true
Feature activation+0.023
,
Token,
Feature activation+0.030
though
Token though
Feature activation-0.246

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.08

Head 2: 0.06

Head 3: 0.08

Head 4: 0.08

Head 5: 0.09

Head 6: 0.07

Head 7: 0.08

Head 8: 0.10

Head 9: 0.08

Head 10: 0.09

Head 11: 0.10

Positive logits

tracing2.94

office2.71

Discuss2.52

parent2.51

descendant2.51

ordered2.46

iblings2.44

cknow2.43

ordering2.40

psc2.40

ilitating2.34

graph2.33

Supervisor2.31

mirrored2.30

reconciliation2.29

ibling2.29

handled2.27

framed2.27

tale2.27

separating2.27

Negative logits

Salt-2.75

Cra-2.60

cial-2.52

iru-2.47

Vik-2.44

Bol-2.42

Oss-2.42

reserve-2.41

Cos-2.37

Rah-2.35

ubi-2.33

ku-2.30

hom-2.30

ateurs-2.29

Kham-2.27

Ney-2.27

rogens-2.26

iliar-2.25

Hom-2.24

NYC-2.23

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

years
Token years
Feature activation+0.000
old
Token old
Feature activation+0.000
âĢ¦
TokenâĢ¦
Feature activation+0.000
stood
Token stood
Feature activation+0.000
in
Token in
Feature activation+0.000
front
Token front
Feature activation+0.000
of
Token of
Feature activation+0.000
a
Token a
Feature activation+0.000
wall
Token wall
Feature activation+0.000
of
Token of
Feature activation+0.000
Lego
Token Lego
Feature activation+0.000
Iraq
Token Iraq
Feature activation+0.000
,
Token,
Feature activation+0.000
he
Token he
Feature activation+0.000
argues
Token argues
Feature activation+0.000
that
Token that
Feature activation+0.000
Afghanistan
Token Afghanistan
Feature activation+0.000
,
Token,
Feature activation+0.000
where
Token where
Feature activation+0.000
the
Token the
Feature activation+0.000
9
Token 9
Feature activation+0.000
/
Token/
Feature activation+0.000
rights
Token rights
Feature activation+0.000
community
Token community
Feature activation+0.000
has
Token has
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
supported
Tokensupported
Feature activation+0.000
the
Token the
Feature activation+0.000
concept
Token concept
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
Equality
Token Equality
Feature activation+0.000
despite
Token despite
Feature activation+0.000
the
Token the
Feature activation+0.000
fact
Token fact
Feature activation+0.000
that
Token that
Feature activation+0.000
legal
Token legal
Feature activation+0.000
construction
Token construction
Feature activation+0.000
starts
Token starts
Feature activation+0.000
have
Token have
Feature activation+0.000
been
Token been
Feature activation+0.000
prohibited
Token prohibited
Feature activation+0.000
for
Token for
Feature activation+0.000
7
Token 7
Feature activation+0.000
.
Token.
Feature activation+0.000
Mens
Token Mens
Feature activation+0.000
Short
Token Short
Feature activation+0.000
Ha
Token Ha
Feature activation+0.000
irst
Tokenirst
Feature activation+0.000
yles
Tokenyles
Feature activation+0.000
+
Token +
Feature activation+0.000
C
Token C
Feature activation+0.000
rop
Tokenrop
Feature activation+0.000
Cut
Token Cut
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 15: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.918

,
Token,
Feature activation-0.013
is
Token is
Feature activation+0.075
that
Token that
Feature activation+0.010
whatever
Token whatever
Feature activation-0.369
policy
Token policy
Feature activation+0.111
prescriptions
Token prescriptions
Feature activation+0.288
that
Token that
Feature activation+0.119
we
Token we
Feature activation+0.085
've
Token've
Feature activation+0.191
been
Token been
Feature activation-0.074
proposing
Token proposing
Feature activation-0.150
<|endoftext|>
Token<|endoftext|>
Feature activation-5.802
perspective
Token perspective
Feature activation-0.046
.
Token.
Feature activation+0.066
"
Token "
Feature activation+0.309
What
TokenWhat
Feature activation-0.547
is
Token is
Feature activation-0.170
true
Token true
Feature activation-0.429
,
Token,
Feature activation+0.101
though
Token though
Feature activation-0.152
,
Token,
Feature activation-0.082
is
Token is
Feature activation+0.064
that
Token that
Feature activation+0.026
whatever
Token whatever
Feature activation-0.238
policy
Token policy
Feature activation-0.088
prescriptions
Token prescriptions
Feature activation+0.346
that
Token that
Feature activation+0.206
we
Token we
Feature activation+0.115
've
Token've
Feature activation+0.107
been
Token been
Feature activation+0.017
proposing
Token proposing
Feature activation+0.259
true
Token true
Feature activation-0.645
,
Token,
Feature activation+0.031
though
Token though
Feature activation-0.225
,
Token,
Feature activation-0.003
is
Token is
Feature activation+0.132
that
Token that
Feature activation+0.744
whatever
Token whatever
Feature activation-0.269
policy
Token policy
Feature activation-0.334
prescriptions
Token prescriptions
Feature activation+0.136
that
Token that
Feature activation+0.224
we
Token we
Feature activation+0.089
<|endoftext|>
Token<|endoftext|>
Feature activation-7.761
perspective
Token perspective
Feature activation-0.127
.
Token.
Feature activation+0.016
"
Token "
Feature activation+0.541
What
TokenWhat
Feature activation+0.409
is
Token is
Feature activation+0.429
true
Token true
Feature activation-0.178
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.954
perspective
Token perspective
Feature activation-0.103
.
Token.
Feature activation+0.213
"
Token "
Feature activation+1.256
What
TokenWhat
Feature activation+0.415
is
Token is
Feature activation+0.143
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.414
perspective
Token perspective
Feature activation-0.331
.
Token.
Feature activation+0.281
"
Token "
Feature activation+1.571
What
TokenWhat
Feature activation-0.613
is
Token is
Feature activation-0.273
true
Token true
Feature activation-1.100
,
Token,
Feature activation+0.136
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.148
.
Token.
Feature activation-0.025
"
Token "
Feature activation-1.317
What
TokenWhat
Feature activation+0.145
is
Token is
Feature activation-0.078
true
Token true
Feature activation+0.518
,
Token,
Feature activation+0.173
though
Token though
Feature activation-0.747
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation+0.014
is
Token is
Feature activation+0.089
that
Token that
Feature activation+0.036
whatever
Token whatever
Feature activation-0.066
policy
Token policy
Feature activation+0.153
prescriptions
Token prescriptions
Feature activation+1.862
that
Token that
Feature activation+0.072
we
Token we
Feature activation+0.219
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.058
is
Token is
Feature activation+0.187
that
Token that
Feature activation+0.412
whatever
Token whatever
Feature activation-0.115
policy
Token policy
Feature activation-0.330
prescriptions
Token prescriptions
Feature activation+0.523
that
Token that
Feature activation+0.353
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
true
Token true
Feature activation-0.308
,
Token,
Feature activation+0.099
though
Token though
Feature activation-0.198
,
Token,
Feature activation-0.063
is
Token is
Feature activation+0.114
that
Token that
Feature activation+0.520
whatever
Token whatever
Feature activation+0.142
policy
Token policy
Feature activation-0.331
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
true
Token true
Feature activation-0.497
,
Token,
Feature activation+0.013
though
Token though
Feature activation-0.263
,
Token,
Feature activation-0.146
is
Token is
Feature activation+0.121
that
Token that
Feature activation+0.899
whatever
Token whatever
Feature activation-0.148
policy
Token policy
Feature activation-0.089
prescriptions
Token prescriptions
Feature activation+0.461
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
,
Token,
Feature activation+0.002
is
Token is
Feature activation+0.123
that
Token that
Feature activation-0.035
whatever
Token whatever
Feature activation-0.024
policy
Token policy
Feature activation+0.318
prescriptions
Token prescriptions
Feature activation+1.762
that
Token that
Feature activation+0.115
we
Token we
Feature activation+0.309
've
Token've
Feature activation+0.190
been
Token been
Feature activation-0.301
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.034
is
Token is
Feature activation+0.087
that
Token that
Feature activation+0.075
whatever
Token whatever
Feature activation-0.015
policy
Token policy
Feature activation+0.438
prescriptions
Token prescriptions
Feature activation+1.918
that
Token that
Feature activation+0.148
we
Token we
Feature activation+0.370
've
Token've
Feature activation-0.208
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.020
is
Token is
Feature activation-0.059
that
Token that
Feature activation-0.140
whatever
Token whatever
Feature activation-0.215
policy
Token policy
Feature activation+0.543
prescriptions
Token prescriptions
Feature activation+0.931
that
Token that
Feature activation+0.307
we
Token we
Feature activation+0.122
've
Token've
Feature activation+0.120
been
Token been
Feature activation-0.016
proposing
Token proposing
Feature activation+0.359
whatever
Token whatever
Feature activation-0.233
policy
Token policy
Feature activation-0.055
prescriptions
Token prescriptions
Feature activation-0.260
that
Token that
Feature activation-0.041
we
Token we
Feature activation+0.008
've
Token've
Feature activation+0.068
been
Token been
Feature activation-0.019
proposing
Token proposing
Feature activation-0.037
don
Token don
Feature activation-1.970
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
true
Token true
Feature activation-0.535
,
Token,
Feature activation+0.083
though
Token though
Feature activation-0.187
,
Token,
Feature activation-0.085
is
Token is
Feature activation+0.247
that
Token that
Feature activation+0.626
whatever
Token whatever
Feature activation-0.198
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.925
perspective
Token perspective
Feature activation-0.110
.
Token.
Feature activation+0.126
"
Token "
Feature activation+0.774
What
TokenWhat
Feature activation-0.184
is
Token is
Feature activation+0.015
true
Token true
Feature activation-0.512
,
Token,
Feature activation+0.072
though
Token though
Feature activation-0.276
<|endoftext|>
Token<|endoftext|>
Feature activation-6.662
perspective
Token perspective
Feature activation-0.167
.
Token.
Feature activation+0.047
"
Token "
Feature activation+0.357
What
TokenWhat
Feature activation-0.102
is
Token is
Feature activation-0.113
true
Token true
Feature activation+0.346
,
Token,
Feature activation+0.158
though
Token though
Feature activation-0.781
<|endoftext|>
Token<|endoftext|>
Feature activation-6.778
perspective
Token perspective
Feature activation-0.087
.
Token.
Feature activation+0.026
"
Token "
Feature activation+0.306
What
TokenWhat
Feature activation-0.058
is
Token is
Feature activation+0.081
true
Token true
Feature activation-1.050
,
Token,
Feature activation+0.204
though
Token though
Feature activation-0.446

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.10

Head 2: 0.08

Head 3: 0.08

Head 4: 0.09

Head 5: 0.09

Head 6: 0.09

Head 7: 0.07

Head 8: 0.08

Head 9: 0.07

Head 10: 0.07

Head 11: 0.09

Positive logits

ople2.88

istors2.85

OPLE2.77

upt2.68

ownt2.60

ongevity2.55

achy2.54

raph2.42

antic2.40

poppy2.40

ossip2.40

nergy2.39

undred2.38

omers2.37

Staten2.36

ismo2.36

transpl2.35

Sov2.35

ombies2.34

cro2.34

Negative logits

Afgh-2.87

Zub-2.85

Kurdish-2.65

////////////////////////////////-2.53

Behavioral-2.51

Hussein-2.50

Rosenstein-2.49

ILE-2.48

OTUS-2.43

ahime-2.42

Beh-2.41

ADA-2.41

ASA-2.40

despicable-2.39

DEFENSE-2.39

attacker-2.39

SAS-2.38

SAM-2.38

Nad-2.37

Goldstein-2.37

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

Please
TokenPlease
Feature activation+0.000
enable
Token enable
Feature activation+0.000
Javascript
Token Javascript
Feature activation+0.000
to
Token to
Feature activation+0.000
watch
Token watch
Feature activation+0.000
this
Token this
Feature activation+0.000
video
Token video
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Publisher
TokenPublisher
Feature activation+0.000
The
Token The
Feature activation+0.000
Oz
Token Oz
Feature activation+0.000
my
Token my
Feature activation+0.000
first
Token first
Feature activation+0.000
steps
Token steps
Feature activation+0.000
.
Token.
Feature activation+0.000
That
Token That
Feature activation+0.000
result
Token result
Feature activation+0.000
opened
Token opened
Feature activation+0.000
me
Token me
Feature activation+0.000
a
Token a
Feature activation+0.000
lot
Token lot
Feature activation+0.000
of
Token of
Feature activation+0.000
one
Token one
Feature activation+0.000
talks
Token talks
Feature activation+0.000
about
Token about
Feature activation+0.000
this
Token this
Feature activation+0.000
transformation
Token transformation
Feature activation+0.000
âĢĶ
TokenâĢĶ
Feature activation+0.000
the
Tokenthe
Feature activation+0.000
creation
Token creation
Feature activation+0.000
of
Token of
Feature activation+0.000
yet
Token yet
Feature activation+0.000
another
Token another
Feature activation+0.000
and
Token and
Feature activation+0.000
class
Token class
Feature activation+0.000
,
Token,
Feature activation+0.000
know
Token know
Feature activation+0.000
the
Token the
Feature activation+0.000
interior
Token interior
Feature activation+0.000
is
Token is
Feature activation+0.000
right
Token right
Feature activation+0.000
on
Token on
Feature activation+0.000
par
Token par
Feature activation+0.000
with
Token with
Feature activation+0.000
hunger
Token hunger
Feature activation+0.000
,
Token,
Feature activation+0.000
enabling
Token enabling
Feature activation+0.000
early
Token early
Feature activation+0.000
interventions
Token interventions
Feature activation+0.000
that
Token that
Feature activation+0.000
could
Token could
Feature activation+0.000
prevent
Token prevent
Feature activation+0.000
serious
Token serious
Feature activation+0.000
health
Token health
Feature activation+0.000
consequences
Token consequences
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 16: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.221

whatever
Token whatever
Feature activation-0.599
policy
Token policy
Feature activation-0.083
prescriptions
Token prescriptions
Feature activation-0.200
that
Token that
Feature activation-0.225
we
Token we
Feature activation-0.072
've
Token've
Feature activation+0.090
been
Token been
Feature activation-0.251
proposing
Token proposing
Feature activation-0.115
don
Token don
Feature activation-0.165
't
Token't
Feature activation-0.516
reach
Token reach
Feature activation-2.768
<|endoftext|>
Token<|endoftext|>
Feature activation-6.504
perspective
Token perspective
Feature activation-0.041
.
Token.
Feature activation-0.048
"
Token "
Feature activation+0.182
What
TokenWhat
Feature activation-0.697
is
Token is
Feature activation-0.267
true
Token true
Feature activation-0.423
,
Token,
Feature activation-0.067
though
Token though
Feature activation-0.166
prescriptions
Token prescriptions
Feature activation-0.079
that
Token that
Feature activation-0.086
we
Token we
Feature activation-0.085
've
Token've
Feature activation-0.027
been
Token been
Feature activation-0.131
proposing
Token proposing
Feature activation+0.621
don
Token don
Feature activation-1.382
't
Token't
Feature activation-0.653
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.709
,
Token,
Feature activation-0.161
though
Token though
Feature activation-0.313
,
Token,
Feature activation-0.154
is
Token is
Feature activation-0.012
that
Token that
Feature activation+0.396
whatever
Token whatever
Feature activation-0.451
policy
Token policy
Feature activation-0.506
prescriptions
Token prescriptions
Feature activation-0.391
that
Token that
Feature activation-0.046
we
Token we
Feature activation-0.005
<|endoftext|>
Token<|endoftext|>
Feature activation-8.650
perspective
Token perspective
Feature activation+0.047
.
Token.
Feature activation-0.333
"
Token "
Feature activation+0.255
What
TokenWhat
Feature activation-0.805
is
Token is
Feature activation+0.018
true
Token true
Feature activation-0.409
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.893
perspective
Token perspective
Feature activation-0.058
.
Token.
Feature activation-0.311
"
Token "
Feature activation+0.743
What
TokenWhat
Feature activation-0.694
is
Token is
Feature activation-0.328
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.290
perspective
Token perspective
Feature activation-0.227
.
Token.
Feature activation-0.053
"
Token "
Feature activation+1.221
What
TokenWhat
Feature activation-1.292
is
Token is
Feature activation-0.572
true
Token true
Feature activation-1.110
,
Token,
Feature activation-0.380
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.094
.
Token.
Feature activation-0.009
"
Token "
Feature activation-1.051
What
TokenWhat
Feature activation-0.842
is
Token is
Feature activation-0.299
true
Token true
Feature activation+0.700
,
Token,
Feature activation-0.199
though
Token though
Feature activation-0.947
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.145
is
Token is
Feature activation-0.115
that
Token that
Feature activation-0.500
whatever
Token whatever
Feature activation-0.232
policy
Token policy
Feature activation-0.366
prescriptions
Token prescriptions
Feature activation+0.369
that
Token that
Feature activation-0.437
we
Token we
Feature activation-0.140
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.731
perspective
Token perspective
Feature activation-0.060
.
Token.
Feature activation-0.117
"
Token "
Feature activation+0.155
What
TokenWhat
Feature activation-0.191
is
Token is
Feature activation-0.125
true
Token true
Feature activation-0.330
,
Token,
Feature activation-0.133
though
Token though
Feature activation-0.268
<|endoftext|>
Token<|endoftext|>
Feature activation-8.286
perspective
Token perspective
Feature activation-0.077
.
Token.
Feature activation-0.207
"
Token "
Feature activation+0.025
What
TokenWhat
Feature activation-0.080
is
Token is
Feature activation-0.363
true
Token true
Feature activation-0.433
,
Token,
Feature activation-0.112
though
Token though
Feature activation-0.257
<|endoftext|>
Token<|endoftext|>
Feature activation-6.863
perspective
Token perspective
Feature activation-0.064
.
Token.
Feature activation-0.244
"
Token "
Feature activation+0.078
What
TokenWhat
Feature activation-0.198
is
Token is
Feature activation-0.245
true
Token true
Feature activation-0.556
,
Token,
Feature activation-0.200
though
Token though
Feature activation-0.382
,
Token,
Feature activation-0.152
is
Token is
Feature activation-0.011
that
Token that
Feature activation-0.389
whatever
Token whatever
Feature activation-0.226
policy
Token policy
Feature activation-0.273
prescriptions
Token prescriptions
Feature activation+0.733
that
Token that
Feature activation-0.290
we
Token we
Feature activation-0.193
've
Token've
Feature activation-0.240
been
Token been
Feature activation-0.755
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.166
is
Token is
Feature activation-0.067
that
Token that
Feature activation-0.449
whatever
Token whatever
Feature activation-0.242
policy
Token policy
Feature activation-0.164
prescriptions
Token prescriptions
Feature activation+0.813
that
Token that
Feature activation-0.395
we
Token we
Feature activation+0.020
've
Token've
Feature activation-0.543
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.310
that
Token that
Feature activation-0.313
we
Token we
Feature activation-0.139
've
Token've
Feature activation-0.023
been
Token been
Feature activation-0.175
proposing
Token proposing
Feature activation+0.969
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.281
policy
Token policy
Feature activation-0.067
prescriptions
Token prescriptions
Feature activation-0.313
that
Token that
Feature activation-0.081
we
Token we
Feature activation-0.007
've
Token've
Feature activation+0.086
been
Token been
Feature activation-0.041
proposing
Token proposing
Feature activation-0.044
don
Token don
Feature activation-10.123
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.687
perspective
Token perspective
Feature activation+0.018
.
Token.
Feature activation-0.141
"
Token "
Feature activation+0.163
What
TokenWhat
Feature activation-0.362
is
Token is
Feature activation-0.317
true
Token true
Feature activation-0.678
,
Token,
Feature activation-0.168
though
Token though
Feature activation-0.211
<|endoftext|>
Token<|endoftext|>
Feature activation-7.720
perspective
Token perspective
Feature activation+0.021
.
Token.
Feature activation-0.127
"
Token "
Feature activation+0.539
What
TokenWhat
Feature activation-0.492
is
Token is
Feature activation-0.189
true
Token true
Feature activation-0.433
,
Token,
Feature activation-0.263
though
Token though
Feature activation-0.340
perspective
Token perspective
Feature activation+0.130
.
Token.
Feature activation-0.174
"
Token "
Feature activation+0.170
What
TokenWhat
Feature activation-1.068
is
Token is
Feature activation-0.311
true
Token true
Feature activation+0.281
,
Token,
Feature activation-0.182
though
Token though
Feature activation-0.921
,
Token,
Feature activation-0.775
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.529
perspective
Token perspective
Feature activation+0.026
.
Token.
Feature activation-0.144
"
Token "
Feature activation+0.223
What
TokenWhat
Feature activation-0.591
is
Token is
Feature activation-0.147
true
Token true
Feature activation-1.024
,
Token,
Feature activation-0.192
though
Token though
Feature activation-0.580

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.09

Head 2: 0.07

Head 3: 0.08

Head 4: 0.08

Head 5: 0.07

Head 6: 0.09

Head 7: 0.08

Head 8: 0.08

Head 9: 0.08

Head 10: 0.10

Head 11: 0.09

Positive logits

batter3.28

haw3.08

rehensive2.94

ollow2.93

Totem2.91

odon2.90

NM2.87

heric2.75

hedral2.67

ulhu2.63

ula2.63

apego2.63

oint2.62

hair2.62

NZ2.61

natureconservancy2.61

MpServer2.61

wreck2.60

rarily2.60

imus2.58

Negative logits

ック-3.15

Freedom-2.80

Jess-2.73

Frie-2.67

Bav-2.61

Frieza-2.60

Vers-2.55

Len-2.55

Dana-2.53

Ju-2.47

Lup-2.47

Chelsea-2.45

Rite-2.45

Investigative-2.43

filler-2.41

Leonardo-2.40

Bran-2.37

Greenwald-2.36

Niger-2.36

Universal-2.36

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

discovered
Token discovered
Feature activation+0.000
how
Token how
Feature activation+0.000
to
Token to
Feature activation+0.000
control
Token control
Feature activation+0.000
heat
Token heat
Feature activation+0.000
with
Token with
Feature activation+0.000
a
Token a
Feature activation+0.000
magnetic
Token magnetic
Feature activation+0.000
field
Token field
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
be
Token be
Feature activation+0.000
the
Token the
Feature activation+0.000
life
Token life
Feature activation+0.000
,
Token,
Feature activation+0.000
right
Token right
Feature activation+0.000
?
Token?
Feature activation+0.000
So
Token So
Feature activation+0.000
jealous
Token jealous
Feature activation+0.000
!
Token!
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
).
Token).
Feature activation+0.000
In
Token In
Feature activation+0.000
both
Token both
Feature activation+0.000
cases
Token cases
Feature activation+0.000
,
Token,
Feature activation+0.000
"
Token "
Feature activation+0.000
tin
Tokentin
Feature activation+0.000
whisk
Token whisk
Feature activation+0.000
ers
Tokeners
Feature activation+0.000
"
Token"
Feature activation+0.000
-
Token -
Feature activation+0.000
over
Token over
Feature activation+0.000
the
Token the
Feature activation+0.000
years
Token years
Feature activation+0.000
.
Token.
Feature activation+0.000
He
Token He
Feature activation+0.000
has
Token has
Feature activation+0.000
recently
Token recently
Feature activation+0.000
helped
Token helped
Feature activation+0.000
hockey
Token hockey
Feature activation+0.000
star
Token star
Feature activation+0.000
Sydney
Token Sydney
Feature activation+0.000
District
Token District
Feature activation+0.000
,
Token,
Feature activation+0.000
where
Token where
Feature activation+0.000
they
Token they
Feature activation+0.000
had
Token had
Feature activation+0.000
acquired
Token acquired
Feature activation+0.000
24
Token 24
Feature activation+0.000
acres
Token acres
Feature activation+0.000
of
Token of
Feature activation+0.000
parking
Token parking
Feature activation+0.000
lots
Token lots
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 17: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.912

<|endoftext|>
Token<|endoftext|>
Feature activation-4.334
perspective
Token perspective
Feature activation-0.095
.
Token.
Feature activation-0.167
"
Token "
Feature activation+0.174
What
TokenWhat
Feature activation-0.185
is
Token is
Feature activation-0.209
true
Token true
Feature activation-0.840
,
Token,
Feature activation+0.020
though
Token though
Feature activation-0.104
<|endoftext|>
Token<|endoftext|>
Feature activation-4.183
perspective
Token perspective
Feature activation-0.088
.
Token.
Feature activation-0.055
"
Token "
Feature activation+0.334
What
TokenWhat
Feature activation-0.333
is
Token is
Feature activation-0.159
true
Token true
Feature activation-0.430
,
Token,
Feature activation+0.084
though
Token though
Feature activation-0.211
,
Token,
Feature activation-0.102
is
Token is
Feature activation-0.026
that
Token that
Feature activation-0.155
whatever
Token whatever
Feature activation-0.165
policy
Token policy
Feature activation-0.002
prescriptions
Token prescriptions
Feature activation+0.326
that
Token that
Feature activation+0.123
we
Token we
Feature activation+0.042
've
Token've
Feature activation+0.030
been
Token been
Feature activation-0.120
proposing
Token proposing
Feature activation+0.323
true
Token true
Feature activation-0.578
,
Token,
Feature activation-0.085
though
Token though
Feature activation-0.387
,
Token,
Feature activation-0.048
is
Token is
Feature activation+0.027
that
Token that
Feature activation+0.475
whatever
Token whatever
Feature activation-0.180
policy
Token policy
Feature activation-0.235
prescriptions
Token prescriptions
Feature activation-0.010
that
Token that
Feature activation+0.104
we
Token we
Feature activation-0.017
<|endoftext|>
Token<|endoftext|>
Feature activation-5.610
perspective
Token perspective
Feature activation-0.202
.
Token.
Feature activation-0.521
"
Token "
Feature activation+0.460
What
TokenWhat
Feature activation+0.656
is
Token is
Feature activation-0.000
true
Token true
Feature activation-0.253
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.719
perspective
Token perspective
Feature activation-0.188
.
Token.
Feature activation-0.336
"
Token "
Feature activation+1.495
What
TokenWhat
Feature activation+0.592
is
Token is
Feature activation-0.389
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.628
perspective
Token perspective
Feature activation-0.413
.
Token.
Feature activation-0.139
"
Token "
Feature activation+1.912
What
TokenWhat
Feature activation-0.206
is
Token is
Feature activation-0.377
true
Token true
Feature activation-0.946
,
Token,
Feature activation+0.010
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.214
.
Token.
Feature activation-0.667
"
Token "
Feature activation-1.476
What
TokenWhat
Feature activation+0.184
is
Token is
Feature activation-0.284
true
Token true
Feature activation+1.124
,
Token,
Feature activation-0.002
though
Token though
Feature activation-1.395
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation+0.030
is
Token is
Feature activation-0.047
that
Token that
Feature activation-0.210
whatever
Token whatever
Feature activation-0.036
policy
Token policy
Feature activation-0.292
prescriptions
Token prescriptions
Feature activation+0.505
that
Token that
Feature activation-0.161
we
Token we
Feature activation-0.243
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.101
perspective
Token perspective
Feature activation-0.079
.
Token.
Feature activation-0.191
"
Token "
Feature activation+0.317
What
TokenWhat
Feature activation+0.230
is
Token is
Feature activation-0.059
true
Token true
Feature activation-0.255
,
Token,
Feature activation-0.010
though
Token though
Feature activation-0.312
<|endoftext|>
Token<|endoftext|>
Feature activation-5.377
perspective
Token perspective
Feature activation-0.109
.
Token.
Feature activation-0.218
"
Token "
Feature activation+0.166
What
TokenWhat
Feature activation+0.603
is
Token is
Feature activation-0.270
true
Token true
Feature activation-0.367
,
Token,
Feature activation+0.009
though
Token though
Feature activation-0.311
,
Token,
Feature activation-0.107
<|endoftext|>
Token<|endoftext|>
Feature activation-4.563
perspective
Token perspective
Feature activation-0.071
.
Token.
Feature activation-0.350
"
Token "
Feature activation+0.120
What
TokenWhat
Feature activation+0.447
is
Token is
Feature activation-0.184
true
Token true
Feature activation-0.506
,
Token,
Feature activation-0.040
though
Token though
Feature activation-0.443
,
Token,
Feature activation-0.172
,
Token,
Feature activation-0.024
is
Token is
Feature activation+0.036
that
Token that
Feature activation-0.195
whatever
Token whatever
Feature activation-0.028
policy
Token policy
Feature activation-0.294
prescriptions
Token prescriptions
Feature activation+0.691
that
Token that
Feature activation-0.062
we
Token we
Feature activation-0.066
've
Token've
Feature activation-0.149
been
Token been
Feature activation-0.416
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.065
is
Token is
Feature activation-0.010
that
Token that
Feature activation-0.164
whatever
Token whatever
Feature activation-0.026
policy
Token policy
Feature activation-0.146
prescriptions
Token prescriptions
Feature activation+0.776
that
Token that
Feature activation-0.112
we
Token we
Feature activation-0.094
've
Token've
Feature activation-0.381
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.043
is
Token is
Feature activation-0.131
that
Token that
Feature activation-0.455
whatever
Token whatever
Feature activation-0.269
policy
Token policy
Feature activation+0.055
prescriptions
Token prescriptions
Feature activation+0.682
that
Token that
Feature activation+0.081
we
Token we
Feature activation-0.093
've
Token've
Feature activation+0.010
been
Token been
Feature activation-0.106
proposing
Token proposing
Feature activation+0.542
whatever
Token whatever
Feature activation-0.173
policy
Token policy
Feature activation-0.058
prescriptions
Token prescriptions
Feature activation-0.223
that
Token that
Feature activation-0.046
we
Token we
Feature activation-0.013
've
Token've
Feature activation+0.053
been
Token been
Feature activation-0.033
proposing
Token proposing
Feature activation-0.031
don
Token don
Feature activation-1.570
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.010
perspective
Token perspective
Feature activation-0.043
.
Token.
Feature activation-0.193
"
Token "
Feature activation+0.439
What
TokenWhat
Feature activation+0.062
is
Token is
Feature activation-0.247
true
Token true
Feature activation-0.573
,
Token,
Feature activation+0.011
though
Token though
Feature activation-0.280
<|endoftext|>
Token<|endoftext|>
Feature activation-5.014
perspective
Token perspective
Feature activation-0.117
.
Token.
Feature activation-0.209
"
Token "
Feature activation+0.906
What
TokenWhat
Feature activation-0.144
is
Token is
Feature activation-0.224
true
Token true
Feature activation-0.396
,
Token,
Feature activation-0.073
though
Token though
Feature activation-0.439
perspective
Token perspective
Feature activation-0.270
.
Token.
Feature activation-0.416
"
Token "
Feature activation+0.251
What
TokenWhat
Feature activation+0.123
is
Token is
Feature activation-0.115
true
Token true
Feature activation+0.826
,
Token,
Feature activation+0.008
though
Token though
Feature activation-1.128
,
Token,
Feature activation-0.206
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.877
perspective
Token perspective
Feature activation-0.105
.
Token.
Feature activation-0.501
"
Token "
Feature activation+0.327
What
TokenWhat
Feature activation+0.196
is
Token is
Feature activation-0.110
true
Token true
Feature activation-0.712
,
Token,
Feature activation+0.082
though
Token though
Feature activation-0.742

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.08

Head 2: 0.09

Head 3: 0.08

Head 4: 0.08

Head 5: 0.08

Head 6: 0.09

Head 7: 0.09

Head 8: 0.09

Head 9: 0.07

Head 10: 0.08

Head 11: 0.08

Positive logits

EC2.97

EM2.89

Braz2.88

selves2.79

External2.71

RH2.68

Elect2.68

RH2.68

Leeds2.64

osit2.63

transplant2.62

carry2.60

HIP2.60

"]=>2.59

Symb2.55

capac2.55

Bucc2.54

Lex2.54

EF2.53

Wilhelm2.51

Negative logits

Redditor-3.28

Skinner-3.11

Rohingya-2.87

atari-2.85

isal-2.82

pitted-2.79

uum-2.79

uto-2.77

bamboo-2.75

Utah-2.70

Gameplay-2.67

ende-2.62

ioxide-2.61

Shade-2.61

atu-2.60

scrolls-2.59

ć-2.59

aez-2.54

noodles-2.53

caste-2.52

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

would
Token would
Feature activation+0.000
like
Token like
Feature activation+0.000
to
Token to
Feature activation+0.000
believe
Token believe
Feature activation+0.000
the
Token the
Feature activation+0.000
university
Token university
Feature activation+0.000
doesn
Token doesn
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
t
Tokent
Feature activation+0.000
want
Token want
Feature activation+0.000
are
Token are
Feature activation+0.000
safe
Token safe
Feature activation+0.000
on
Token on
Feature activation+0.000
site
Token site
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
HOW
TokenHOW
Feature activation+0.000
FREE
Token FREE
Feature activation+0.000
ARE
Token ARE
Feature activation+0.000
FREE
Token FREE
Feature activation+0.000
that
Token that
Feature activation+0.000
he
Token he
Feature activation+0.000
would
Token would
Feature activation+0.000
probably
Token probably
Feature activation+0.000
violate
Token violate
Feature activation+0.000
his
Token his
Feature activation+0.000
gag
Token gag
Feature activation+0.000
order
Token order
Feature activation+0.000
if
Token if
Feature activation+0.000
he
Token he
Feature activation+0.000
talked
Token talked
Feature activation+0.000
international
Token international
Feature activation+0.000
conference
Token conference
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
People
TokenPeople
Feature activation+0.000
over
Token over
Feature activation+0.000
45
Token 45
Feature activation+0.000
basically
Token basically
Feature activation+0.000
die
Token die
Feature activation+0.000
in
Token in
Feature activation+0.000
terms
Token terms
Feature activation+0.000
excellent
Token excellent
Feature activation+0.000
time
Token time
Feature activation+0.000
to
Token to
Feature activation+0.000
engage
Token engage
Feature activation+0.000
in
Token in
Feature activation+0.000
some
Token some
Feature activation+0.000
bald
Token bald
Feature activation+0.000
,
Token,
Feature activation+0.000
obvious
Token obvious
Feature activation+0.000
polit
Token polit
Feature activation+0.000
icking
Tokenicking
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 18: Ultra low frequency cluster

TOP ACTIVATIONS
MAX = 1.477

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+1.477
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 12.085

the
Token the
Feature activation-0.016
team
Token team
Feature activation-0.012
has
Token has
Feature activation-0.011
signed
Token signed
Feature activation-0.007
forward
Token forward
Feature activation+0.036
Tyson
Token Tyson
Feature activation+12.085
J
Token J
Feature activation-0.093
ost
Tokenost
Feature activation-0.115
(
Token (
Feature activation-0.014
J
TokenJ
Feature activation-0.051
OH
TokenOH
Feature activation-0.030
<|endoftext|>
Token<|endoftext|>
Feature activation-3.497
perspective
Token perspective
Feature activation-0.053
.
Token.
Feature activation+0.019
"
Token "
Feature activation+0.136
What
TokenWhat
Feature activation-0.325
is
Token is
Feature activation-0.077
true
Token true
Feature activation-0.322
,
Token,
Feature activation+0.001
though
Token though
Feature activation-0.198
whatever
Token whatever
Feature activation-0.166
policy
Token policy
Feature activation-0.061
prescriptions
Token prescriptions
Feature activation-0.185
that
Token that
Feature activation-0.048
we
Token we
Feature activation-0.009
've
Token've
Feature activation+0.040
been
Token been
Feature activation-0.021
proposing
Token proposing
Feature activation-0.016
don
Token don
Feature activation-3.864
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
true
Token true
Feature activation-0.399
,
Token,
Feature activation+0.020
though
Token though
Feature activation-0.304
,
Token,
Feature activation-0.071
is
Token is
Feature activation-0.046
that
Token that
Feature activation+0.130
whatever
Token whatever
Feature activation-0.323
policy
Token policy
Feature activation-0.533
prescriptions
Token prescriptions
Feature activation-0.278
that
Token that
Feature activation-0.149
we
Token we
Feature activation-0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.761
perspective
Token perspective
Feature activation-0.230
.
Token.
Feature activation-0.210
"
Token "
Feature activation+0.193
What
TokenWhat
Feature activation-0.019
is
Token is
Feature activation+0.461
true
Token true
Feature activation-0.291
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.900
perspective
Token perspective
Feature activation-0.238
.
Token.
Feature activation-0.142
"
Token "
Feature activation+0.660
What
TokenWhat
Feature activation+0.001
is
Token is
Feature activation+0.183
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-3.932
perspective
Token perspective
Feature activation-0.363
.
Token.
Feature activation+0.074
"
Token "
Feature activation+0.879
What
TokenWhat
Feature activation-0.431
is
Token is
Feature activation-0.076
true
Token true
Feature activation-0.754
,
Token,
Feature activation+0.031
though
Token though
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.085
that
Token that
Feature activation-0.097
we
Token we
Feature activation-0.067
've
Token've
Feature activation-0.023
been
Token been
Feature activation+0.014
proposing
Token proposing
Feature activation+0.218
don
Token don
Feature activation-0.560
't
Token't
Feature activation-0.279
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
,
Token,
Feature activation-0.105
is
Token is
Feature activation-0.078
that
Token that
Feature activation-0.380
whatever
Token whatever
Feature activation-0.282
policy
Token policy
Feature activation-0.005
prescriptions
Token prescriptions
Feature activation+1.244
that
Token that
Feature activation-0.433
we
Token we
Feature activation-0.397
've
Token've
Feature activation-0.340
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-3.774
perspective
Token perspective
Feature activation-0.049
.
Token.
Feature activation+0.020
"
Token "
Feature activation+0.146
What
TokenWhat
Feature activation+0.138
is
Token is
Feature activation-0.008
true
Token true
Feature activation-0.405
,
Token,
Feature activation+0.051
though
Token though
Feature activation-0.353
<|endoftext|>
Token<|endoftext|>
Feature activation-4.204
perspective
Token perspective
Feature activation-0.039
.
Token.
Feature activation+0.004
"
Token "
Feature activation+0.209
What
TokenWhat
Feature activation-0.046
is
Token is
Feature activation-0.034
true
Token true
Feature activation-0.462
,
Token,
Feature activation+0.029
though
Token though
Feature activation-0.239
<|endoftext|>
Token<|endoftext|>
Feature activation-4.555
perspective
Token perspective
Feature activation-0.060
.
Token.
Feature activation-0.030
"
Token "
Feature activation+0.061
What
TokenWhat
Feature activation+0.223
is
Token is
Feature activation-0.126
true
Token true
Feature activation-0.287
,
Token,
Feature activation+0.008
though
Token though
Feature activation-0.261
,
Token,
Feature activation-0.087
,
Token,
Feature activation-0.117
is
Token is
Feature activation-0.054
that
Token that
Feature activation-0.284
whatever
Token whatever
Feature activation-0.260
policy
Token policy
Feature activation-0.032
prescriptions
Token prescriptions
Feature activation+1.118
that
Token that
Feature activation-0.304
we
Token we
Feature activation-0.327
've
Token've
Feature activation-0.134
been
Token been
Feature activation-0.373
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.145
is
Token is
Feature activation-0.116
that
Token that
Feature activation-0.413
whatever
Token whatever
Feature activation-0.251
policy
Token policy
Feature activation-0.180
prescriptions
Token prescriptions
Feature activation+0.881
that
Token that
Feature activation-0.406
we
Token we
Feature activation-0.376
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.124
is
Token is
Feature activation-0.139
that
Token that
Feature activation-0.564
whatever
Token whatever
Feature activation-0.395
policy
Token policy
Feature activation+0.282
prescriptions
Token prescriptions
Feature activation+0.516
that
Token that
Feature activation-0.131
we
Token we
Feature activation-0.140
've
Token've
Feature activation-0.020
been
Token been
Feature activation-0.035
proposing
Token proposing
Feature activation+0.433
perspective
Token perspective
Feature activation-0.412
.
Token.
Feature activation-0.054
"
Token "
Feature activation+0.073
What
TokenWhat
Feature activation-0.217
is
Token is
Feature activation+0.060
true
Token true
Feature activation+0.459
,
Token,
Feature activation+0.038
though
Token though
Feature activation-0.905
,
Token,
Feature activation-0.228
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.252
perspective
Token perspective
Feature activation-0.048
.
Token.
Feature activation+0.009
"
Token "
Feature activation+0.158
What
TokenWhat
Feature activation+0.038
is
Token is
Feature activation+0.025
true
Token true
Feature activation-0.209
,
Token,
Feature activation+0.017
though
Token though
Feature activation-0.253
<|endoftext|>
Token<|endoftext|>
Feature activation-4.213
perspective
Token perspective
Feature activation-0.079
.
Token.
Feature activation+0.001
"
Token "
Feature activation+0.413
What
TokenWhat
Feature activation-0.184
is
Token is
Feature activation+0.026
true
Token true
Feature activation-0.316
,
Token,
Feature activation+0.014
though
Token though
Feature activation-0.348
perspective
Token perspective
Feature activation-0.196
.
Token.
Feature activation+0.042
"
Token "
Feature activation-1.459
What
TokenWhat
Feature activation-0.144
is
Token is
Feature activation+0.021
true
Token true
Feature activation+0.315
,
Token,
Feature activation+0.095
though
Token though
Feature activation-1.220
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.090
perspective
Token perspective
Feature activation-0.157
.
Token.
Feature activation-0.105
"
Token "
Feature activation+0.059
What
TokenWhat
Feature activation-0.070
is
Token is
Feature activation+0.169
true
Token true
Feature activation-0.562
,
Token,
Feature activation+0.046
though
Token though
Feature activation-0.598
,
Token,
Feature activation-0.178
is
Token is
Feature activation-0.225

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.08

Head 2: 0.08

Head 3: 0.09

Head 4: 0.08

Head 5: 0.09

Head 6: 0.07

Head 7: 0.08

Head 8: 0.09

Head 9: 0.08

Head 10: 0.08

Head 11: 0.09

Positive logits

Kendrick2.83

Scand2.82

Laur2.82

Trudeau2.78

Zucker2.72

Saud2.65

Anderson2.62

OIL2.62

Kid2.61

Raptors2.55

DJ2.55

prof2.54

ommel2.53

10702.53

Ubisoft2.52

Vaughn2.52

Morrison2.50

ODY2.50

Trance2.48

?)2.47

Negative logits

龍�-3.07

shell-3.04

Indigo-2.95

acles-2.81

Gob-2.72

-2.59

Offline-2.58

atorial-2.56

umph-2.54

romeda-2.52

adop-2.51

Ele-2.50

natureconservancy-2.50

typh-2.48

QR-2.48

CrossRef-2.46

square-2.45

alpha-2.45

acle-2.43

PowerShell-2.40

INTERVAL 1.329 - 1.477
CONTAINS 0.000%

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+1.477
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000

INTERVAL 1.181 - 1.329
CONTAINS 0.000%

INTERVAL 1.034 - 1.181
CONTAINS 0.000%

INTERVAL 0.886 - 1.034
CONTAINS 0.000%

INTERVAL 0.738 - 0.886
CONTAINS 0.000%

INTERVAL 0.591 - 0.738
CONTAINS 0.000%

INTERVAL 0.443 - 0.591
CONTAINS 0.000%

INTERVAL 0.295 - 0.443
CONTAINS 0.000%

INTERVAL 0.148 - 0.295
CONTAINS 0.000%

INTERVAL 0.000 - 0.148
CONTAINS 100.000%

and
Token and
Feature activation+0.000
7
Token 7
Feature activation+0.000
assists
Token assists
Feature activation+0.000
in
Token in
Feature activation+0.000
the
Token the
Feature activation+0.000
win
Token win
Feature activation+0.000
.
Token.
Feature activation+0.000
Arthur
Token Arthur
Feature activation+0.000
Edwards
Token Edwards
Feature activation+0.000
led
Token led
Feature activation+0.000
the
Token the
Feature activation+0.000
a
Token a
Feature activation+0.000
much
Token much
Feature activation+0.000
bigger
Token bigger
Feature activation+0.000
positive
Token positive
Feature activation+0.000
economic
Token economic
Feature activation+0.000
impact
Token impact
Feature activation+0.000
than
Token than
Feature activation+0.000
eliminating
Token eliminating
Feature activation+0.000
land
Token land
Feature activation+0.000
use
Token use
Feature activation+0.000
restrictions
Token restrictions
Feature activation+0.000
people
Token people
Feature activation+0.000
died
Token died
Feature activation+0.000
in
Tokenin
Feature activation+0.000
the
Token the
Feature activation+0.000
attacks
Token attacks
Feature activation+0.000
,
Token,
Feature activation+0.000
while
Token while
Feature activation+0.000
another
Token another
Feature activation+0.000
three
Token three
Feature activation+0.000
were
Token were
Feature activation+0.000
killed
Token killed
Feature activation+0.000
igl
Tokenigl
Feature activation+0.000
io
Tokenio
Feature activation+0.000
Sports
TokenSports
Feature activation+0.000
.
Token.
Feature activation+0.000
Find
Token Find
Feature activation+0.000
NJ
Token NJ
Feature activation+0.000
.
Token.
Feature activation+0.000
com
Tokencom
Feature activation+0.000
on
Token on
Feature activation+0.000
Facebook
Token Facebook
Feature activation+0.000
.
Token.
Feature activation+0.000
Lake
Token Lake
Feature activation+0.000
City
Token City
Feature activation+0.000
,
Token,
Feature activation+0.000
UT
Token UT
Feature activation+0.000
âĢĵ
Token âĢĵ
Feature activation+0.000
US
Token US
Feature activation+0.000
ANA
TokenANA
Feature activation+0.000
Amph
Token Amph
Feature activation+0.000
ithe
Tokenithe
Feature activation+0.000
atre
Tokenatre
Feature activation+0.000
+++
Token+++
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 19: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.254

been
Token been
Feature activation-0.199
proposing
Token proposing
Feature activation-0.151
don
Token don
Feature activation-0.187
't
Token't
Feature activation-0.558
reach
Token reach
Feature activation-2.987
,
Token,
Feature activation+0.119
are
Token are
Feature activation-0.561
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
been
Token been
Feature activation-0.170
proposing
Token proposing
Feature activation-0.186
don
Token don
Feature activation-0.320
't
Token't
Feature activation-0.371
reach
Token reach
Feature activation-1.161
,
Token,
Feature activation+0.226
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.019
that
Token that
Feature activation-0.125
we
Token we
Feature activation-0.007
've
Token've
Feature activation+0.092
been
Token been
Feature activation-0.036
proposing
Token proposing
Feature activation+0.560
don
Token don
Feature activation-1.053
't
Token't
Feature activation-0.639
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.698
,
Token,
Feature activation-0.309
though
Token though
Feature activation-0.611
,
Token,
Feature activation-0.199
is
Token is
Feature activation-0.039
that
Token that
Feature activation+0.264
whatever
Token whatever
Feature activation-0.415
policy
Token policy
Feature activation-0.572
prescriptions
Token prescriptions
Feature activation-0.267
that
Token that
Feature activation-0.107
we
Token we
Feature activation-0.140
<|endoftext|>
Token<|endoftext|>
Feature activation-8.506
perspective
Token perspective
Feature activation-0.203
.
Token.
Feature activation-0.574
"
Token "
Feature activation-0.034
What
TokenWhat
Feature activation+0.160
is
Token is
Feature activation-0.190
true
Token true
Feature activation-0.466
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.735
perspective
Token perspective
Feature activation-0.314
.
Token.
Feature activation-0.457
"
Token "
Feature activation+0.654
What
TokenWhat
Feature activation+0.138
is
Token is
Feature activation-0.604
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.254
perspective
Token perspective
Feature activation-0.590
.
Token.
Feature activation-0.188
"
Token "
Feature activation+1.254
What
TokenWhat
Feature activation-0.807
is
Token is
Feature activation-0.634
true
Token true
Feature activation-1.150
,
Token,
Feature activation-0.692
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.267
.
Token.
Feature activation-0.262
"
Token "
Feature activation-0.810
What
TokenWhat
Feature activation-0.273
is
Token is
Feature activation-0.283
true
Token true
Feature activation+0.238
,
Token,
Feature activation-0.511
though
Token though
Feature activation-2.525
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.614
perspective
Token perspective
Feature activation-0.018
.
Token.
Feature activation-0.055
"
Token "
Feature activation+0.119
What
TokenWhat
Feature activation-0.043
is
Token is
Feature activation-0.285
true
Token true
Feature activation-0.168
,
Token,
Feature activation-0.143
though
Token though
Feature activation-0.210
<|endoftext|>
Token<|endoftext|>
Feature activation-7.677
perspective
Token perspective
Feature activation-0.098
.
Token.
Feature activation-0.165
"
Token "
Feature activation-0.033
What
TokenWhat
Feature activation+0.096
is
Token is
Feature activation-0.173
true
Token true
Feature activation-0.390
,
Token,
Feature activation-0.309
though
Token though
Feature activation-0.503
,
Token,
Feature activation-0.237
<|endoftext|>
Token<|endoftext|>
Feature activation-8.152
perspective
Token perspective
Feature activation-0.141
.
Token.
Feature activation-0.204
"
Token "
Feature activation-0.206
What
TokenWhat
Feature activation+0.520
is
Token is
Feature activation-0.446
true
Token true
Feature activation-0.471
,
Token,
Feature activation-0.258
though
Token though
Feature activation-0.504
,
Token,
Feature activation-0.244
<|endoftext|>
Token<|endoftext|>
Feature activation-6.821
perspective
Token perspective
Feature activation-0.106
.
Token.
Feature activation-0.286
"
Token "
Feature activation-0.337
What
TokenWhat
Feature activation+0.256
is
Token is
Feature activation-0.354
true
Token true
Feature activation-0.630
,
Token,
Feature activation-0.377
though
Token though
Feature activation-0.693
,
Token,
Feature activation-0.340
,
Token,
Feature activation-0.227
is
Token is
Feature activation-0.056
that
Token that
Feature activation-0.645
whatever
Token whatever
Feature activation-0.326
policy
Token policy
Feature activation-0.515
prescriptions
Token prescriptions
Feature activation+0.041
that
Token that
Feature activation-0.498
we
Token we
Feature activation-0.258
've
Token've
Feature activation-0.104
been
Token been
Feature activation-0.479
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.229
is
Token is
Feature activation-0.102
that
Token that
Feature activation-0.766
whatever
Token whatever
Feature activation-0.343
policy
Token policy
Feature activation-0.295
prescriptions
Token prescriptions
Feature activation+0.161
that
Token that
Feature activation-0.690
we
Token we
Feature activation-0.359
've
Token've
Feature activation-0.517
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.261
that
Token that
Feature activation-0.480
we
Token we
Feature activation-0.250
've
Token've
Feature activation+0.065
been
Token been
Feature activation-0.031
proposing
Token proposing
Feature activation+0.880
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.231
policy
Token policy
Feature activation-0.087
prescriptions
Token prescriptions
Feature activation-0.273
that
Token that
Feature activation-0.092
we
Token we
Feature activation-0.036
've
Token've
Feature activation+0.072
been
Token been
Feature activation-0.030
proposing
Token proposing
Feature activation-0.034
don
Token don
Feature activation-7.340
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.585
perspective
Token perspective
Feature activation-0.035
.
Token.
Feature activation-0.170
"
Token "
Feature activation+0.020
What
TokenWhat
Feature activation-0.157
is
Token is
Feature activation-0.393
true
Token true
Feature activation-0.748
,
Token,
Feature activation-0.382
though
Token though
Feature activation-0.477
<|endoftext|>
Token<|endoftext|>
Feature activation-7.642
perspective
Token perspective
Feature activation-0.093
.
Token.
Feature activation-0.216
"
Token "
Feature activation+0.444
What
TokenWhat
Feature activation-0.390
is
Token is
Feature activation-0.317
true
Token true
Feature activation-0.513
,
Token,
Feature activation-0.562
though
Token though
Feature activation-0.711
perspective
Token perspective
Feature activation-0.215
.
Token.
Feature activation-0.366
"
Token "
Feature activation-0.155
What
TokenWhat
Feature activation-0.481
is
Token is
Feature activation-0.283
true
Token true
Feature activation+0.269
,
Token,
Feature activation-0.495
though
Token though
Feature activation-1.812
,
Token,
Feature activation-0.705
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.489
perspective
Token perspective
Feature activation-0.138
.
Token.
Feature activation-0.393
"
Token "
Feature activation+0.180
What
TokenWhat
Feature activation-0.071
is
Token is
Feature activation-0.124
true
Token true
Feature activation-1.039
,
Token,
Feature activation-0.492
though
Token though
Feature activation-1.205

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.07

Head 2: 0.08

Head 3: 0.09

Head 4: 0.09

Head 5: 0.08

Head 6: 0.09

Head 7: 0.07

Head 8: 0.10

Head 9: 0.10

Head 10: 0.08

Head 11: 0.09

Positive logits

ortium3.18

etr3.02

vernment3.02

weld3.01

steril2.82

ugu2.76

opal2.74

regn2.73

robots2.72

unemploy2.72

Romanian2.70

pregn2.63

sterling2.58

¯¯¯¯¯¯¯¯2.58

Pug2.56

roman2.55

henko2.55

bots2.55

Weld2.53

alias2.53

Negative logits

Koen-3.14

Ey-2.92

awa-2.91

Mem-2.80

caches-2.77

Amin-2.65

Ghost-2.63

Mississ-2.63

Mour-2.61

Hide-2.60

Wa-2.58

Freedom-2.56

ACY-2.55

Coff-2.50

Holiday-2.49

FO-2.46

Midnight-2.46

wrapper-2.45

Clause-2.45

Boost-2.45

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

are
Token are
Feature activation+0.000
declining
Token declining
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
not
Token not
Feature activation+0.000
because
Token because
Feature activation+0.000
people
Token people
Feature activation+0.000
hate
Token hate
Feature activation+0.000
it
Token it
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
and
Token and
Feature activation+0.000
attach
Token attach
Feature activation+0.000
tanks
Token tanks
Feature activation+0.000
,
Token,
Feature activation+0.000
batteries
Token batteries
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
instruments
Token instruments
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
spacecraft
Token spacecraft
Feature activation+0.000
least
Token least
Feature activation+0.000
three
Token three
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
wrong
Token wrong
Feature activation+0.000
suspects
Token suspects
Feature activation+0.000
were
Token were
Feature activation+0.000
charged
Token charged
Feature activation+0.000
,
Token,
Feature activation+0.000
including
Token including
Feature activation+0.000
one
Token one
Feature activation+0.000
As
Token As
Feature activation+0.000
Christians
Token Christians
Feature activation+0.000
we
Token we
Feature activation+0.000
live
Token live
Feature activation+0.000
under
Token under
Feature activation+0.000
the
Token the
Feature activation+0.000
belief
Token belief
Feature activation+0.000
that
Token that
Feature activation+0.000
God
Token God
Feature activation+0.000
is
Token is
Feature activation+0.000
alive
Token alive
Feature activation+0.000
other
Token other
Feature activation+0.000
ways
Token ways
Feature activation+0.000
.
Token.
Feature activation+0.000
We
Token We
Feature activation+0.000
will
Token will
Feature activation+0.000
be
Token be
Feature activation+0.000
monitoring
Token monitoring
Feature activation+0.000
the
Token the
Feature activation+0.000
ban
Token ban
Feature activation+0.000
closely
Token closely
Feature activation+0.000
and
Token and
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 20: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.940

,
Token,
Feature activation-0.045
is
Token is
Feature activation-0.052
that
Token that
Feature activation-0.485
whatever
Token whatever
Feature activation-0.452
policy
Token policy
Feature activation+0.042
prescriptions
Token prescriptions
Feature activation+0.223
that
Token that
Feature activation-0.198
we
Token we
Feature activation+0.013
've
Token've
Feature activation+0.154
been
Token been
Feature activation-0.200
proposing
Token proposing
Feature activation-0.076
<|endoftext|>
Token<|endoftext|>
Feature activation-5.389
perspective
Token perspective
Feature activation-0.067
.
Token.
Feature activation-0.001
"
Token "
Feature activation+0.316
What
TokenWhat
Feature activation-0.496
is
Token is
Feature activation-0.266
true
Token true
Feature activation-0.305
,
Token,
Feature activation+0.068
though
Token though
Feature activation-0.213
prescriptions
Token prescriptions
Feature activation+0.071
that
Token that
Feature activation-0.057
we
Token we
Feature activation-0.045
've
Token've
Feature activation-0.016
been
Token been
Feature activation-0.098
proposing
Token proposing
Feature activation+0.365
don
Token don
Feature activation-0.828
't
Token't
Feature activation-0.382
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.539
,
Token,
Feature activation+0.050
though
Token though
Feature activation-0.265
,
Token,
Feature activation-0.035
is
Token is
Feature activation-0.058
that
Token that
Feature activation+0.286
whatever
Token whatever
Feature activation-0.347
policy
Token policy
Feature activation-0.555
prescriptions
Token prescriptions
Feature activation-0.228
that
Token that
Feature activation-0.046
we
Token we
Feature activation+0.075
<|endoftext|>
Token<|endoftext|>
Feature activation-7.288
perspective
Token perspective
Feature activation-0.122
.
Token.
Feature activation-0.336
"
Token "
Feature activation+0.543
What
TokenWhat
Feature activation-0.394
is
Token is
Feature activation-0.276
true
Token true
Feature activation-0.218
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.470
perspective
Token perspective
Feature activation-0.189
.
Token.
Feature activation-0.135
"
Token "
Feature activation+1.321
What
TokenWhat
Feature activation-0.375
is
Token is
Feature activation-0.422
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.002
perspective
Token perspective
Feature activation-0.379
.
Token.
Feature activation+0.072
"
Token "
Feature activation+1.646
What
TokenWhat
Feature activation-0.814
is
Token is
Feature activation-0.591
true
Token true
Feature activation-0.660
,
Token,
Feature activation+0.129
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.206
.
Token.
Feature activation-0.329
"
Token "
Feature activation-2.443
What
TokenWhat
Feature activation-0.508
is
Token is
Feature activation-0.252
true
Token true
Feature activation+0.442
,
Token,
Feature activation+0.271
though
Token though
Feature activation-1.135
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.075
is
Token is
Feature activation-0.155
that
Token that
Feature activation-0.661
whatever
Token whatever
Feature activation-0.265
policy
Token policy
Feature activation+0.047
prescriptions
Token prescriptions
Feature activation+1.688
that
Token that
Feature activation-0.458
we
Token we
Feature activation-0.114
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.601
perspective
Token perspective
Feature activation-0.093
.
Token.
Feature activation-0.093
"
Token "
Feature activation+0.353
What
TokenWhat
Feature activation-0.132
is
Token is
Feature activation-0.243
true
Token true
Feature activation-0.266
,
Token,
Feature activation+0.067
though
Token though
Feature activation-0.249
<|endoftext|>
Token<|endoftext|>
Feature activation-7.047
perspective
Token perspective
Feature activation-0.101
.
Token.
Feature activation-0.130
"
Token "
Feature activation+0.248
What
TokenWhat
Feature activation+0.042
is
Token is
Feature activation-0.403
true
Token true
Feature activation-0.319
,
Token,
Feature activation+0.072
though
Token though
Feature activation-0.236
<|endoftext|>
Token<|endoftext|>
Feature activation-5.879
perspective
Token perspective
Feature activation-0.075
.
Token.
Feature activation-0.190
"
Token "
Feature activation+0.365
What
TokenWhat
Feature activation-0.079
is
Token is
Feature activation-0.320
true
Token true
Feature activation-0.391
,
Token,
Feature activation+0.033
though
Token though
Feature activation-0.324
,
Token,
Feature activation-0.065
is
Token is
Feature activation-0.045
that
Token that
Feature activation-0.495
whatever
Token whatever
Feature activation-0.267
policy
Token policy
Feature activation+0.237
prescriptions
Token prescriptions
Feature activation+1.778
that
Token that
Feature activation-0.296
we
Token we
Feature activation-0.103
've
Token've
Feature activation+0.001
been
Token been
Feature activation-0.595
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.083
is
Token is
Feature activation-0.108
that
Token that
Feature activation-0.587
whatever
Token whatever
Feature activation-0.289
policy
Token policy
Feature activation+0.297
prescriptions
Token prescriptions
Feature activation+1.940
that
Token that
Feature activation-0.377
we
Token we
Feature activation-0.016
've
Token've
Feature activation-0.363
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.061
is
Token is
Feature activation-0.208
that
Token that
Feature activation-0.781
whatever
Token whatever
Feature activation-0.391
policy
Token policy
Feature activation+0.599
prescriptions
Token prescriptions
Feature activation+0.720
that
Token that
Feature activation-0.134
we
Token we
Feature activation+0.054
've
Token've
Feature activation+0.050
been
Token been
Feature activation-0.136
proposing
Token proposing
Feature activation+0.660
whatever
Token whatever
Feature activation-0.187
policy
Token policy
Feature activation-0.072
prescriptions
Token prescriptions
Feature activation-0.237
that
Token that
Feature activation-0.070
we
Token we
Feature activation+0.005
've
Token've
Feature activation+0.054
been
Token been
Feature activation-0.035
proposing
Token proposing
Feature activation-0.041
don
Token don
Feature activation-4.422
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.499
perspective
Token perspective
Feature activation-0.033
.
Token.
Feature activation-0.076
"
Token "
Feature activation+0.418
What
TokenWhat
Feature activation-0.245
is
Token is
Feature activation-0.410
true
Token true
Feature activation-0.495
,
Token,
Feature activation+0.089
though
Token though
Feature activation-0.220
<|endoftext|>
Token<|endoftext|>
Feature activation-6.482
perspective
Token perspective
Feature activation-0.042
.
Token.
Feature activation-0.052
"
Token "
Feature activation+0.834
What
TokenWhat
Feature activation-0.380
is
Token is
Feature activation-0.268
true
Token true
Feature activation-0.341
,
Token,
Feature activation+0.061
though
Token though
Feature activation-0.329
perspective
Token perspective
Feature activation-0.221
.
Token.
Feature activation-0.229
"
Token "
Feature activation+0.342
What
TokenWhat
Feature activation-0.630
is
Token is
Feature activation-0.368
true
Token true
Feature activation+0.577
,
Token,
Feature activation+0.117
though
Token though
Feature activation-0.942
,
Token,
Feature activation-0.204
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
.
Token.
Feature activation-0.289
"
Token "
Feature activation+0.099
What
TokenWhat
Feature activation-0.335
is
Token is
Feature activation-0.224
true
Token true
Feature activation-0.772
,
Token,
Feature activation+0.216
though
Token though
Feature activation-0.552
,
Token,
Feature activation-0.068
is
Token is
Feature activation-0.292
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.08

Head 2: 0.08

Head 3: 0.09

Head 4: 0.09

Head 5: 0.08

Head 6: 0.07

Head 7: 0.09

Head 8: 0.08

Head 9: 0.09

Head 10: 0.08

Head 11: 0.09

Positive logits

Kus3.35

uld3.07

------2.91

Kuwait2.78

UV2.73

phosph2.71

sa2.67

Kush2.56

activated2.53

LC2.51

サーティワン2.49

ع2.41

Plasma2.38

mus2.37

[+]2.33

enium2.32

MJ2.32

ihad2.30

aters2.29

Saiyan2.29

Negative logits

ransom-2.80

predic-2.55

oin-2.50

pigeon-2.45

nails-2.38

ugu-2.38

dogs-2.37

nom-2.34

uture-2.34

tenure-2.30

snap-2.30

pick-2.28

outper-2.28

standard-2.27

fares-2.25

ailable-2.24

ispers-2.23

oice-2.22

appointments-2.22

Ding-2.22

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

a
Token a
Feature activation+0.000
move
Token move
Feature activation+0.000
sure
Token sure
Feature activation+0.000
to
Token to
Feature activation+0.000
shock
Token shock
Feature activation+0.000
the
Token the
Feature activation+0.000
auto
Token auto
Feature activation+0.000
world
Token world
Feature activation+0.000
,
Token,
Feature activation+0.000
Motor
Token Motor
Feature activation+0.000
Trend
Token Trend
Feature activation+0.000
a
Token a
Feature activation+0.000
set
Token set
Feature activation+0.000
of
Token of
Feature activation+0.000
twins
Token twins
Feature activation+0.000
,
Token,
Feature activation+0.000
and
Token and
Feature activation+0.000
Ms
Token Ms
Feature activation+0.000
Gall
Token Gall
Feature activation+0.000
oway
Tokenoway
Feature activation+0.000
said
Token said
Feature activation+0.000
no
Token no
Feature activation+0.000
the
Token the
Feature activation+0.000
item
Token item
Feature activation+0.000
cannot
Token cannot
Feature activation+0.000
be
Token be
Feature activation+0.000
traded
Token traded
Feature activation+0.000
once
Token once
Feature activation+0.000
a
Token a
Feature activation+0.000
player
Token player
Feature activation+0.000
picks
Token picks
Feature activation+0.000
it
Token it
Feature activation+0.000
up
Token up
Feature activation+0.000
:
Token:
Feature activation+0.000
Gil
Token Gil
Feature activation+0.000
Brand
Token Brand
Feature activation+0.000
t
Tokent
Feature activation+0.000
,
Token,
Feature activation+0.000
B
Token B
Feature activation+0.000
ucky
Tokenucky
Feature activation+0.000
Brooks
Token Brooks
Feature activation+0.000
,
Token,
Feature activation+0.000
Char
Token Char
Feature activation+0.000
ley
Tokenley
Feature activation+0.000
and
Token and
Feature activation+0.000
to
Token to
Feature activation+0.000
cover
Token cover
Feature activation+0.000
food
Token food
Feature activation+0.000
,
Token,
Feature activation+0.000
clothing
Token clothing
Feature activation+0.000
,
Token,
Feature activation+0.000
housing
Token housing
Feature activation+0.000
,
Token,
Feature activation+0.000
education
Token education
Feature activation+0.000
and
Token and
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 21: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 0.735

whatever
Token whatever
Feature activation-0.386
policy
Token policy
Feature activation+0.034
prescriptions
Token prescriptions
Feature activation-0.175
that
Token that
Feature activation-0.137
we
Token we
Feature activation-0.138
've
Token've
Feature activation+0.081
been
Token been
Feature activation-0.241
proposing
Token proposing
Feature activation-0.173
don
Token don
Feature activation-0.152
't
Token't
Feature activation-0.151
reach
Token reach
Feature activation-3.111
<|endoftext|>
Token<|endoftext|>
Feature activation-6.146
perspective
Token perspective
Feature activation-0.023
.
Token.
Feature activation-0.013
"
Token "
Feature activation+0.072
What
TokenWhat
Feature activation-0.546
is
Token is
Feature activation-0.267
true
Token true
Feature activation-0.621
,
Token,
Feature activation-0.172
though
Token though
Feature activation-0.269
prescriptions
Token prescriptions
Feature activation-0.006
that
Token that
Feature activation-0.077
we
Token we
Feature activation-0.230
've
Token've
Feature activation-0.101
been
Token been
Feature activation-0.139
proposing
Token proposing
Feature activation+0.147
don
Token don
Feature activation-1.578
't
Token't
Feature activation-0.338
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.800
,
Token,
Feature activation-0.061
though
Token though
Feature activation-0.370
,
Token,
Feature activation-0.112
is
Token is
Feature activation-0.182
that
Token that
Feature activation+0.063
whatever
Token whatever
Feature activation-0.461
policy
Token policy
Feature activation-0.510
prescriptions
Token prescriptions
Feature activation-0.319
that
Token that
Feature activation-0.092
we
Token we
Feature activation-0.091
<|endoftext|>
Token<|endoftext|>
Feature activation-8.195
perspective
Token perspective
Feature activation-0.096
.
Token.
Feature activation-0.131
"
Token "
Feature activation+0.394
What
TokenWhat
Feature activation-0.590
is
Token is
Feature activation-0.401
true
Token true
Feature activation-1.066
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.391
perspective
Token perspective
Feature activation-0.068
.
Token.
Feature activation-0.139
"
Token "
Feature activation+0.735
What
TokenWhat
Feature activation-0.509
is
Token is
Feature activation-0.411
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.854
perspective
Token perspective
Feature activation-0.231
.
Token.
Feature activation+0.062
"
Token "
Feature activation+0.733
What
TokenWhat
Feature activation-0.997
is
Token is
Feature activation-0.573
true
Token true
Feature activation-1.727
,
Token,
Feature activation-0.253
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.458
perspective
Token perspective
Feature activation-0.127
.
Token.
Feature activation+0.229
"
Token "
Feature activation-1.137
What
TokenWhat
Feature activation-0.541
is
Token is
Feature activation-0.223
true
Token true
Feature activation-1.122
,
Token,
Feature activation+0.051
,
Token,
Feature activation-0.254
is
Token is
Feature activation-0.207
that
Token that
Feature activation-0.442
whatever
Token whatever
Feature activation-0.286
policy
Token policy
Feature activation-0.121
prescriptions
Token prescriptions
Feature activation+0.392
that
Token that
Feature activation-0.378
we
Token we
Feature activation-0.422
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
though
Token though
Feature activation-0.315
,
Token,
Feature activation-0.130
is
Token is
Feature activation-0.285
that
Token that
Feature activation-0.148
whatever
Token whatever
Feature activation-0.412
policy
Token policy
Feature activation+0.221
prescriptions
Token prescriptions
Feature activation-0.310
that
Token that
Feature activation-0.013
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
though
Token though
Feature activation-0.301
,
Token,
Feature activation-0.117
is
Token is
Feature activation-0.232
that
Token that
Feature activation-0.325
whatever
Token whatever
Feature activation-0.328
policy
Token policy
Feature activation+0.206
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
though
Token though
Feature activation-0.417
,
Token,
Feature activation-0.177
is
Token is
Feature activation-0.293
that
Token that
Feature activation+0.133
whatever
Token whatever
Feature activation-0.671
policy
Token policy
Feature activation+0.434
prescriptions
Token prescriptions
Feature activation-0.215
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation-0.186
is
Token is
Feature activation-0.128
that
Token that
Feature activation-0.321
whatever
Token whatever
Feature activation-0.304
policy
Token policy
Feature activation-0.020
prescriptions
Token prescriptions
Feature activation+0.430
that
Token that
Feature activation-0.321
we
Token we
Feature activation-0.372
've
Token've
Feature activation-0.207
been
Token been
Feature activation-0.809
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.146
is
Token is
Feature activation-0.177
that
Token that
Feature activation-0.467
whatever
Token whatever
Feature activation-0.334
policy
Token policy
Feature activation+0.022
prescriptions
Token prescriptions
Feature activation+0.575
that
Token that
Feature activation-0.410
we
Token we
Feature activation-0.201
've
Token've
Feature activation-0.449
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
though
Token though
Feature activation-0.445
,
Token,
Feature activation-0.225
is
Token is
Feature activation-0.280
that
Token that
Feature activation-0.719
whatever
Token whatever
Feature activation-0.431
policy
Token policy
Feature activation+0.477
prescriptions
Token prescriptions
Feature activation+0.324
that
Token that
Feature activation+0.005
we
Token we
Feature activation-0.151
've
Token've
Feature activation-0.081
been
Token been
Feature activation-0.173
whatever
Token whatever
Feature activation-0.237
policy
Token policy
Feature activation-0.085
prescriptions
Token prescriptions
Feature activation-0.294
that
Token that
Feature activation-0.063
we
Token we
Feature activation-0.019
've
Token've
Feature activation+0.075
been
Token been
Feature activation-0.038
proposing
Token proposing
Feature activation-0.039
don
Token don
Feature activation-10.976
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.236
perspective
Token perspective
Feature activation-0.004
.
Token.
Feature activation-0.050
"
Token "
Feature activation+0.166
What
TokenWhat
Feature activation-0.239
is
Token is
Feature activation-0.397
true
Token true
Feature activation-0.855
,
Token,
Feature activation-0.166
though
Token though
Feature activation-0.285
<|endoftext|>
Token<|endoftext|>
Feature activation-7.312
perspective
Token perspective
Feature activation-0.023
.
Token.
Feature activation+0.009
"
Token "
Feature activation+0.437
What
TokenWhat
Feature activation-0.303
is
Token is
Feature activation-0.229
true
Token true
Feature activation-0.766
,
Token,
Feature activation-0.142
though
Token though
Feature activation-0.442
<|endoftext|>
Token<|endoftext|>
Feature activation-7.071
perspective
Token perspective
Feature activation-0.173
.
Token.
Feature activation-0.003
"
Token "
Feature activation+0.156
What
TokenWhat
Feature activation-0.686
is
Token is
Feature activation-0.457
true
Token true
Feature activation-1.098
,
Token,
Feature activation-0.134
though
Token though
Feature activation-1.205
<|endoftext|>
Token<|endoftext|>
Feature activation-7.149
perspective
Token perspective
Feature activation-0.049
.
Token.
Feature activation-0.012
"
Token "
Feature activation+0.038
What
TokenWhat
Feature activation-0.511
is
Token is
Feature activation-0.261
true
Token true
Feature activation-1.693
,
Token,
Feature activation-0.197
though
Token though
Feature activation-0.751

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.08

Head 2: 0.09

Head 3: 0.08

Head 4: 0.07

Head 5: 0.09

Head 6: 0.08

Head 7: 0.08

Head 8: 0.08

Head 9: 0.09

Head 10: 0.08

Head 11: 0.09

Positive logits

Wal3.25

igham3.20

cro3.09

Giuliani3.08

Huckabee2.95

Weaver2.86

ocrates2.74

venants2.73

Monk2.70

Michelle2.69

Mills2.67

arella2.60

robber2.59

Rite2.56

berman2.55

Eisenhower2.55

Sanders2.52

Gideon2.52

Sanders2.51

McMahon2.51

Negative logits

alys-3.87

incent-3.13

phon-2.89

Catal-2.89

CN-2.82

apy-2.77

-2.74

hon-2.66

Ake-2.65

Qiao-2.65

dam-2.61

usercontent-2.61

wreck-2.60

ensional-2.60

Crimean-2.57

Contribut-2.56

modelling-2.52

ğ-2.46

Remix-2.46

Creat-2.44

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

memories
Token memories
Feature activation+0.000
you
Token you
Feature activation+0.000
make
Token make
Feature activation+0.000
on
Token on
Feature activation+0.000
the
Token the
Feature activation+0.000
go
Token go
Feature activation+0.000
Un
Token Un
Feature activation+0.000
official
Tokenofficial
Feature activation+0.000
Ch
Token Ch
Feature activation+0.000
angel
Tokenangel
Feature activation+0.000
og
Tokenog
Feature activation+0.000
nights
Token nights
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Yes
TokenYes
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
Harbour
Token Harbour
Feature activation+0.000
front
Tokenfront
Feature activation+0.000
Centre
Token Centre
Feature activation+0.000
's
Token's
Feature activation+0.000
businessmen
Token businessmen
Feature activation+0.000
Tom
Token Tom
Feature activation+0.000
G
Token G
Feature activation+0.000
ores
Tokenores
Feature activation+0.000
and
Token and
Feature activation+0.000
Dan
Token Dan
Feature activation+0.000
Gilbert
Token Gilbert
Feature activation+0.000
to
Token to
Feature activation+0.000
build
Token build
Feature activation+0.000
a
Token a
Feature activation+0.000
soccer
Token soccer
Feature activation+0.000
For
Token For
Feature activation+0.000
$
Token $
Feature activation+0.000
100
Token100
Feature activation+0.000
per
Token per
Feature activation+0.000
month
Token month
Feature activation+0.000
,
Token,
Feature activation+0.000
customers
Token customers
Feature activation+0.000
receive
Token receive
Feature activation+0.000
a
Token a
Feature activation+0.000
physical
Token physical
Feature activation+0.000
box
Token box
Feature activation+0.000
now
Token now
Feature activation+0.000
endorsing
Token endorsing
Feature activation+0.000
the
Token the
Feature activation+0.000
current
Token current
Feature activation+0.000
work
Token work
Feature activation+0.000
by
Token by
Feature activation+0.000
Mr
Token Mr
Feature activation+0.000
Ai
Token Ai
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 22: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 0.751

<|endoftext|>
Token<|endoftext|>
Feature activation-4.190
perspective
Token perspective
Feature activation-0.026
.
Token.
Feature activation-0.146
"
Token "
Feature activation+0.072
What
TokenWhat
Feature activation-0.273
is
Token is
Feature activation-0.213
true
Token true
Feature activation-0.755
,
Token,
Feature activation-0.052
though
Token though
Feature activation-0.074
<|endoftext|>
Token<|endoftext|>
Feature activation-4.015
perspective
Token perspective
Feature activation+0.002
.
Token.
Feature activation-0.081
"
Token "
Feature activation+0.100
What
TokenWhat
Feature activation-0.323
is
Token is
Feature activation-0.209
true
Token true
Feature activation-0.413
,
Token,
Feature activation-0.032
though
Token though
Feature activation-0.145
prescriptions
Token prescriptions
Feature activation-0.044
that
Token that
Feature activation-0.148
we
Token we
Feature activation-0.001
've
Token've
Feature activation-0.065
been
Token been
Feature activation-0.083
proposing
Token proposing
Feature activation+0.297
don
Token don
Feature activation-0.657
't
Token't
Feature activation-0.470
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.563
,
Token,
Feature activation-0.104
though
Token though
Feature activation-0.240
,
Token,
Feature activation-0.070
is
Token is
Feature activation-0.029
that
Token that
Feature activation+0.107
whatever
Token whatever
Feature activation-0.288
policy
Token policy
Feature activation-0.323
prescriptions
Token prescriptions
Feature activation-0.149
that
Token that
Feature activation-0.079
we
Token we
Feature activation-0.059
<|endoftext|>
Token<|endoftext|>
Feature activation-5.359
perspective
Token perspective
Feature activation+0.129
.
Token.
Feature activation-0.427
"
Token "
Feature activation+0.266
What
TokenWhat
Feature activation-0.445
is
Token is
Feature activation-0.213
true
Token true
Feature activation-0.366
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.513
perspective
Token perspective
Feature activation+0.043
.
Token.
Feature activation-0.405
"
Token "
Feature activation+0.701
What
TokenWhat
Feature activation-0.429
is
Token is
Feature activation-0.398
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.485
perspective
Token perspective
Feature activation-0.039
.
Token.
Feature activation-0.243
"
Token "
Feature activation+0.751
What
TokenWhat
Feature activation-0.565
is
Token is
Feature activation-0.484
true
Token true
Feature activation-0.990
,
Token,
Feature activation-0.198
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.010
.
Token.
Feature activation-0.485
"
Token "
Feature activation-1.240
What
TokenWhat
Feature activation-0.485
is
Token is
Feature activation-0.297
true
Token true
Feature activation+0.256
,
Token,
Feature activation-0.236
though
Token though
Feature activation-0.975
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.049
is
Token is
Feature activation-0.109
that
Token that
Feature activation-0.590
whatever
Token whatever
Feature activation-0.230
policy
Token policy
Feature activation-0.146
prescriptions
Token prescriptions
Feature activation+0.248
that
Token that
Feature activation-0.506
we
Token we
Feature activation-0.244
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.874
perspective
Token perspective
Feature activation+0.010
.
Token.
Feature activation-0.202
"
Token "
Feature activation+0.136
What
TokenWhat
Feature activation-0.110
is
Token is
Feature activation-0.153
true
Token true
Feature activation-0.249
,
Token,
Feature activation-0.039
though
Token though
Feature activation-0.191
<|endoftext|>
Token<|endoftext|>
Feature activation-5.185
perspective
Token perspective
Feature activation-0.003
.
Token.
Feature activation-0.222
"
Token "
Feature activation+0.056
What
TokenWhat
Feature activation-0.055
is
Token is
Feature activation-0.322
true
Token true
Feature activation-0.325
,
Token,
Feature activation-0.032
though
Token though
Feature activation-0.191
<|endoftext|>
Token<|endoftext|>
Feature activation-4.373
perspective
Token perspective
Feature activation-0.015
.
Token.
Feature activation-0.313
"
Token "
Feature activation+0.056
What
TokenWhat
Feature activation-0.111
is
Token is
Feature activation-0.231
true
Token true
Feature activation-0.493
,
Token,
Feature activation-0.097
though
Token though
Feature activation-0.266
,
Token,
Feature activation-0.067
is
Token is
Feature activation-0.034
that
Token that
Feature activation-0.435
whatever
Token whatever
Feature activation-0.226
policy
Token policy
Feature activation-0.112
prescriptions
Token prescriptions
Feature activation+0.308
that
Token that
Feature activation-0.384
we
Token we
Feature activation-0.119
've
Token've
Feature activation-0.220
been
Token been
Feature activation-0.478
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.081
is
Token is
Feature activation-0.056
that
Token that
Feature activation-0.498
whatever
Token whatever
Feature activation-0.238
policy
Token policy
Feature activation+0.006
prescriptions
Token prescriptions
Feature activation+0.432
that
Token that
Feature activation-0.467
we
Token we
Feature activation-0.154
've
Token've
Feature activation-0.431
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.163
that
Token that
Feature activation-0.317
we
Token we
Feature activation-0.217
've
Token've
Feature activation-0.088
been
Token been
Feature activation-0.117
proposing
Token proposing
Feature activation+0.425
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.184
policy
Token policy
Feature activation-0.056
prescriptions
Token prescriptions
Feature activation-0.223
that
Token that
Feature activation-0.066
we
Token we
Feature activation-0.021
've
Token've
Feature activation+0.074
been
Token been
Feature activation-0.030
proposing
Token proposing
Feature activation-0.051
don
Token don
Feature activation-4.491
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-4.828
perspective
Token perspective
Feature activation+0.103
.
Token.
Feature activation-0.264
"
Token "
Feature activation+0.194
What
TokenWhat
Feature activation-0.249
is
Token is
Feature activation-0.307
true
Token true
Feature activation-0.546
,
Token,
Feature activation-0.103
though
Token though
Feature activation-0.183
<|endoftext|>
Token<|endoftext|>
Feature activation-4.818
perspective
Token perspective
Feature activation+0.119
.
Token.
Feature activation-0.241
"
Token "
Feature activation+0.382
What
TokenWhat
Feature activation-0.285
is
Token is
Feature activation-0.199
true
Token true
Feature activation-0.442
,
Token,
Feature activation-0.112
though
Token though
Feature activation-0.272
<|endoftext|>
Token<|endoftext|>
Feature activation-4.614
perspective
Token perspective
Feature activation+0.209
.
Token.
Feature activation-0.356
"
Token "
Feature activation+0.086
What
TokenWhat
Feature activation-0.620
is
Token is
Feature activation-0.291
true
Token true
Feature activation+0.191
<|endoftext|>
Token<|endoftext|>
Feature activation-4.672
perspective
Token perspective
Feature activation+0.093
.
Token.
Feature activation-0.391
"
Token "
Feature activation+0.003
What
TokenWhat
Feature activation-0.238
is
Token is
Feature activation-0.218
true
Token true
Feature activation-0.881

Decoder Weights Distribution

Head 0: 0.07

Head 1: 0.07

Head 2: 0.07

Head 3: 0.09

Head 4: 0.08

Head 5: 0.10

Head 6: 0.09

Head 7: 0.08

Head 8: 0.09

Head 9: 0.09

Head 10: 0.09

Head 11: 0.08

Positive logits

blem3.18

wo2.82

Rivals2.79

Shipping2.78

shadow2.74

Pose2.73

Ripple2.65

Shant2.64

etition2.62

IPS2.61

Coin2.60

rud2.60

ripp2.58

Seah2.58

stacked2.52

hang2.51

itter2.48

rival2.47

antle2.44

Saints2.44

Negative logits

Downloadha-2.93

Python-2.91

Libyan-2.88

Libre-2.72

clitor-2.69

Mot-2.67

Tunis-2.59

kb-2.58

actionDate-2.58

Berks-2.58

French-2.57

adic-2.53

princip-2.51

Jordanian-2.51

Yemeni-2.50

Qur-2.50

Somali-2.47

French-2.46

CVE-2.43

vec-2.41

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

ood
Tokenood
Feature activation+0.000
les
Tokenles
Feature activation+0.000
British
Token British
Feature activation+0.000
Gin
Token Gin
Feature activation+0.000
is
Token is
Feature activation+0.000
a
Token a
Feature activation+0.000
brand
Token brand
Feature activation+0.000
of
Token of
Feature activation+0.000
gin
Token gin
Feature activation+0.000
bottled
Token bottled
Feature activation+0.000
and
Token and
Feature activation+0.000
have
Token have
Feature activation+0.000
recently
Token recently
Feature activation+0.000
been
Token been
Feature activation+0.000
proposed
Token proposed
Feature activation+0.000
for
Token for
Feature activation+0.000
the
Token the
Feature activation+0.000
surrounding
Token surrounding
Feature activation+0.000
area
Token area
Feature activation+0.000
(
Token (
Feature activation+0.000
see
Tokensee
Feature activation+0.000
here
Token here
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Law
TokenLaw
Feature activation+0.000
rence
Tokenrence
Feature activation+0.000
Keane
Token Keane
Feature activation+0.000
,
Token,
Feature activation+0.000
N
Token N
Feature activation+0.000
SS
TokenSS
Feature activation+0.000
F
TokenF
Feature activation+0.000
Black
Token Black
Feature activation+0.000
River
Token River
Feature activation+0.000
Falls
Token Falls
Feature activation+0.000
-
Token-
Feature activation+0.000
based
Tokenbased
Feature activation+0.000
Hoffman
Token Hoffman
Feature activation+0.000
Construction
Token Construction
Feature activation+0.000
Co
Token Co
Feature activation+0.000
.,
Token.,
Feature activation+0.000
which
Token which
Feature activation+0.000
works
Token works
Feature activation+0.000
him
Token him
Feature activation+0.000
from
Token from
Feature activation+0.000
realizing
Token realizing
Feature activation+0.000
his
Token his
Feature activation+0.000
dreams
Token dreams
Feature activation+0.000
.
Token.
Feature activation+0.000
To
Token To
Feature activation+0.000
honor
Token honor
Feature activation+0.000
his
Token his
Feature activation+0.000
memory
Token memory
Feature activation+0.000
,
Token,
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 23: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 1.398

<|endoftext|>
Token<|endoftext|>
Feature activation-6.534
perspective
Token perspective
Feature activation-0.087
.
Token.
Feature activation-0.151
"
Token "
Feature activation+0.092
What
TokenWhat
Feature activation-0.356
is
Token is
Feature activation-0.457
true
Token true
Feature activation-1.222
,
Token,
Feature activation-0.027
though
Token though
Feature activation-0.174
<|endoftext|>
Token<|endoftext|>
Feature activation-6.330
perspective
Token perspective
Feature activation-0.066
.
Token.
Feature activation-0.007
"
Token "
Feature activation+0.231
What
TokenWhat
Feature activation-0.524
is
Token is
Feature activation-0.426
true
Token true
Feature activation-0.770
,
Token,
Feature activation-0.037
though
Token though
Feature activation-0.336
prescriptions
Token prescriptions
Feature activation-0.043
that
Token that
Feature activation-0.271
we
Token we
Feature activation-0.053
've
Token've
Feature activation-0.100
been
Token been
Feature activation-0.126
proposing
Token proposing
Feature activation+0.436
don
Token don
Feature activation-1.032
't
Token't
Feature activation-0.721
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.493
perspective
Token perspective
Feature activation-0.069
.
Token.
Feature activation-0.068
"
Token "
Feature activation+0.051
What
TokenWhat
Feature activation-0.083
is
Token is
Feature activation-0.418
true
Token true
Feature activation-1.043
,
Token,
Feature activation-0.127
though
Token though
Feature activation-0.501
<|endoftext|>
Token<|endoftext|>
Feature activation-8.392
perspective
Token perspective
Feature activation-0.202
.
Token.
Feature activation-0.165
"
Token "
Feature activation+0.392
What
TokenWhat
Feature activation-0.141
is
Token is
Feature activation-0.513
true
Token true
Feature activation-0.815
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.564
perspective
Token perspective
Feature activation-0.191
.
Token.
Feature activation-0.119
"
Token "
Feature activation+0.982
What
TokenWhat
Feature activation-0.203
is
Token is
Feature activation-0.904
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.979
perspective
Token perspective
Feature activation-0.442
.
Token.
Feature activation+0.081
"
Token "
Feature activation+1.398
What
TokenWhat
Feature activation-0.752
is
Token is
Feature activation-0.986
true
Token true
Feature activation-1.954
,
Token,
Feature activation-0.252
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.649
perspective
Token perspective
Feature activation-0.215
.
Token.
Feature activation+0.508
"
Token "
Feature activation-0.912
What
TokenWhat
Feature activation-0.320
is
Token is
Feature activation-0.552
true
Token true
Feature activation+0.137
,
Token,
Feature activation-0.078
<|endoftext|>
Token<|endoftext|>
Feature activation-8.372
perspective
Token perspective
Feature activation-0.003
.
Token.
Feature activation-0.057
"
Token "
Feature activation+0.126
What
TokenWhat
Feature activation-0.143
is
Token is
Feature activation-0.378
true
Token true
Feature activation-0.254
,
Token,
Feature activation-0.023
though
Token though
Feature activation-0.178
<|endoftext|>
Token<|endoftext|>
Feature activation-7.636
perspective
Token perspective
Feature activation-0.061
.
Token.
Feature activation-0.046
"
Token "
Feature activation+0.253
What
TokenWhat
Feature activation-0.100
is
Token is
Feature activation-0.369
true
Token true
Feature activation-0.595
,
Token,
Feature activation-0.082
though
Token though
Feature activation-0.431
<|endoftext|>
Token<|endoftext|>
Feature activation-8.086
perspective
Token perspective
Feature activation-0.101
.
Token.
Feature activation-0.155
"
Token "
Feature activation+0.145
What
TokenWhat
Feature activation+0.058
is
Token is
Feature activation-0.660
true
Token true
Feature activation-0.734
,
Token,
Feature activation-0.072
though
Token though
Feature activation-0.439
<|endoftext|>
Token<|endoftext|>
Feature activation-6.820
perspective
Token perspective
Feature activation-0.073
.
Token.
Feature activation-0.189
"
Token "
Feature activation+0.293
What
TokenWhat
Feature activation-0.115
is
Token is
Feature activation-0.552
true
Token true
Feature activation-0.959
,
Token,
Feature activation-0.162
though
Token though
Feature activation-0.594
<|endoftext|>
Token<|endoftext|>
Feature activation-8.344
perspective
Token perspective
Feature activation-0.049
.
Token.
Feature activation-0.126
"
Token "
Feature activation+0.034
What
TokenWhat
Feature activation-0.149
is
Token is
Feature activation-0.174
true
Token true
Feature activation-0.268
,
Token,
Feature activation-0.029
though
Token though
Feature activation-0.132
<|endoftext|>
Token<|endoftext|>
Feature activation-8.375
perspective
Token perspective
Feature activation-0.029
.
Token.
Feature activation-0.131
"
Token "
Feature activation+0.042
What
TokenWhat
Feature activation-0.107
is
Token is
Feature activation-0.234
true
Token true
Feature activation-0.269
,
Token,
Feature activation-0.035
though
Token though
Feature activation-0.234
prescriptions
Token prescriptions
Feature activation-0.020
that
Token that
Feature activation-0.382
we
Token we
Feature activation-0.156
've
Token've
Feature activation-0.072
been
Token been
Feature activation-0.129
proposing
Token proposing
Feature activation+0.734
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.295
policy
Token policy
Feature activation-0.079
prescriptions
Token prescriptions
Feature activation-0.277
that
Token that
Feature activation-0.083
we
Token we
Feature activation-0.010
've
Token've
Feature activation+0.065
been
Token been
Feature activation-0.043
proposing
Token proposing
Feature activation-0.046
don
Token don
Feature activation-6.915
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.527
perspective
Token perspective
Feature activation-0.011
.
Token.
Feature activation-0.004
"
Token "
Feature activation+0.313
What
TokenWhat
Feature activation-0.259
is
Token is
Feature activation-0.733
true
Token true
Feature activation-1.159
,
Token,
Feature activation-0.121
though
Token though
Feature activation-0.404
<|endoftext|>
Token<|endoftext|>
Feature activation-7.541
perspective
Token perspective
Feature activation-0.105
.
Token.
Feature activation+0.014
"
Token "
Feature activation+0.714
What
TokenWhat
Feature activation-0.373
is
Token is
Feature activation-0.565
true
Token true
Feature activation-0.883
,
Token,
Feature activation-0.184
though
Token though
Feature activation-0.586
<|endoftext|>
Token<|endoftext|>
Feature activation-7.211
perspective
Token perspective
Feature activation-0.285
.
Token.
Feature activation-0.042
"
Token "
Feature activation+0.317
What
TokenWhat
Feature activation-0.531
is
Token is
Feature activation-0.566
true
Token true
Feature activation-0.316
,
Token,
Feature activation-0.052
though
Token though
Feature activation-1.513
<|endoftext|>
Token<|endoftext|>
Feature activation-7.273
perspective
Token perspective
Feature activation-0.110
.
Token.
Feature activation+0.166
"
Token "
Feature activation+0.356
What
TokenWhat
Feature activation-0.243
is
Token is
Feature activation-0.440
true
Token true
Feature activation-1.565
,
Token,
Feature activation-0.093
though
Token though
Feature activation-0.981

Decoder Weights Distribution

Head 0: 0.07

Head 1: 0.07

Head 2: 0.07

Head 3: 0.09

Head 4: 0.09

Head 5: 0.08

Head 6: 0.09

Head 7: 0.09

Head 8: 0.09

Head 9: 0.09

Head 10: 0.09

Head 11: 0.08

Positive logits

Flags2.98

PLA2.92

pestic2.76

rex2.72

ebus2.61

Anim2.59

FG2.58

blo2.56

asers2.56

mascul2.54

isible2.54

guns2.53

MU2.53

BLM2.50

NI2.49

Scouting2.48

disabled2.48

CU2.46

Pyro2.46

retard2.45

Negative logits

tsky-2.69

intervals-2.61

*/(-2.57

deep-2.46

ibaba-2.44

rition-2.43

Borg-2.42

Trial-2.41

Mald-2.40

oled-2.39

aceutical-2.38

ghai-2.34

Dh-2.34

Pwr-2.32

Mart-2.31

trial-2.31

ixon-2.28

ebted-2.28

itime-2.27

Mart-2.27

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

4
Token4
Feature activation+0.000
,
Token,
Feature activation+0.000
500
Token500
Feature activation+0.000
a
Token a
Feature activation+0.000
month
Token month
Feature activation+0.000
for
Token for
Feature activation+0.000
rent
Token rent
Feature activation+0.000
and
Token and
Feature activation+0.000
$
Token $
Feature activation+0.000
6
Token6
Feature activation+0.000
,
Token,
Feature activation+0.000
government
Token government
Feature activation+0.000
has
Token has
Feature activation+0.000
continued
Token continued
Feature activation+0.000
its
Token its
Feature activation+0.000
support
Token support
Feature activation+0.000
for
Token for
Feature activation+0.000
the
Token the
Feature activation+0.000
Lebanese
Token Lebanese
Feature activation+0.000
military
Token military
Feature activation+0.000
,
Token,
Feature activation+0.000
which
Token which
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
really
Tokenreally
Feature activation+0.000
painful
Token painful
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
My
TokenMy
Feature activation+0.000
anmar
Tokenanmar
Feature activation+0.000
criminal
Token criminal
Feature activation+0.000
justice
Token justice
Feature activation+0.000
at
Token at
Feature activation+0.000
Michigan
Token Michigan
Feature activation+0.000
State
Token State
Feature activation+0.000
.
Token.
Feature activation+0.000
The
Token The
Feature activation+0.000
study
Token study
Feature activation+0.000
,
Token,
Feature activation+0.000
published
Token published
Feature activation+0.000
online
Token online
Feature activation+0.000
have
Token have
Feature activation+0.000
dropped
Token dropped
Feature activation+0.000
science
Token science
Feature activation+0.000
at
Token at
Feature activation+0.000
16
Token 16
Feature activation+0.000
.
Token.
Feature activation+0.000
That
Token That
Feature activation+0.000
is
Token is
Feature activation+0.000
just
Token just
Feature activation+0.000
a
Token a
Feature activation+0.000
fact
Token fact
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 24: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 0.430

whatever
Token whatever
Feature activation-0.276
policy
Token policy
Feature activation-0.166
prescriptions
Token prescriptions
Feature activation-0.208
that
Token that
Feature activation-0.143
we
Token we
Feature activation-0.091
've
Token've
Feature activation+0.009
been
Token been
Feature activation-0.035
proposing
Token proposing
Feature activation-0.131
don
Token don
Feature activation-0.064
't
Token't
Feature activation-0.109
reach
Token reach
Feature activation-0.805
.
Token.
Feature activation-0.026
"
Token "
Feature activation+0.018
What
TokenWhat
Feature activation-0.271
is
Token is
Feature activation-0.134
true
Token true
Feature activation-0.345
,
Token,
Feature activation+0.037
though
Token though
Feature activation-0.100
,
Token,
Feature activation-0.040
is
Token is
Feature activation-0.045
that
Token that
Feature activation-0.064
whatever
Token whatever
Feature activation-0.320
prescriptions
Token prescriptions
Feature activation-0.057
that
Token that
Feature activation-0.076
we
Token we
Feature activation+0.031
've
Token've
Feature activation+0.065
been
Token been
Feature activation+0.037
proposing
Token proposing
Feature activation+0.067
don
Token don
Feature activation-0.140
't
Token't
Feature activation-0.105
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.451
,
Token,
Feature activation-0.060
though
Token though
Feature activation-0.149
,
Token,
Feature activation-0.064
is
Token is
Feature activation+0.030
that
Token that
Feature activation+0.162
whatever
Token whatever
Feature activation-0.194
policy
Token policy
Feature activation-0.301
prescriptions
Token prescriptions
Feature activation-0.253
that
Token that
Feature activation-0.088
we
Token we
Feature activation-0.047
<|endoftext|>
Token<|endoftext|>
Feature activation-2.123
perspective
Token perspective
Feature activation-0.138
.
Token.
Feature activation-0.115
"
Token "
Feature activation+0.071
What
TokenWhat
Feature activation-0.209
is
Token is
Feature activation-0.104
true
Token true
Feature activation-0.539
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-2.169
perspective
Token perspective
Feature activation-0.162
.
Token.
Feature activation-0.066
"
Token "
Feature activation+0.333
What
TokenWhat
Feature activation-0.221
is
Token is
Feature activation-0.231
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.812
perspective
Token perspective
Feature activation-0.234
.
Token.
Feature activation-0.056
"
Token "
Feature activation+0.430
What
TokenWhat
Feature activation-0.515
is
Token is
Feature activation-0.293
true
Token true
Feature activation-0.857
,
Token,
Feature activation-0.052
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.911
perspective
Token perspective
Feature activation-0.122
.
Token.
Feature activation+0.023
"
Token "
Feature activation-0.136
What
TokenWhat
Feature activation-0.323
is
Token is
Feature activation-0.151
true
Token true
Feature activation-0.225
,
Token,
Feature activation-0.068
<|endoftext|>
Token<|endoftext|>
Feature activation-2.084
perspective
Token perspective
Feature activation-0.019
.
Token.
Feature activation-0.000
"
Token "
Feature activation+0.043
What
TokenWhat
Feature activation-0.066
is
Token is
Feature activation-0.090
true
Token true
Feature activation-0.124
,
Token,
Feature activation+0.009
though
Token though
Feature activation-0.052
is
Token is
Feature activation-0.088
true
Token true
Feature activation-0.258
,
Token,
Feature activation-0.034
though
Token though
Feature activation-0.124
,
Token,
Feature activation-0.066
is
Token is
Feature activation+0.048
that
Token that
Feature activation-0.021
whatever
Token whatever
Feature activation-0.149
policy
Token policy
Feature activation-0.504
prescriptions
Token prescriptions
Feature activation-0.381
that
Token that
Feature activation-0.248
is
Token is
Feature activation-0.179
true
Token true
Feature activation-0.285
,
Token,
Feature activation-0.008
though
Token though
Feature activation-0.128
,
Token,
Feature activation-0.063
is
Token is
Feature activation+0.007
that
Token that
Feature activation+0.004
whatever
Token whatever
Feature activation-0.126
policy
Token policy
Feature activation-0.426
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
true
Token true
Feature activation-0.401
,
Token,
Feature activation-0.073
though
Token though
Feature activation-0.171
,
Token,
Feature activation-0.098
is
Token is
Feature activation+0.013
that
Token that
Feature activation+0.052
whatever
Token whatever
Feature activation-0.268
policy
Token policy
Feature activation-0.397
prescriptions
Token prescriptions
Feature activation-0.237
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-2.062
perspective
Token perspective
Feature activation-0.031
.
Token.
Feature activation-0.041
"
Token "
Feature activation+0.014
What
TokenWhat
Feature activation-0.068
is
Token is
Feature activation-0.026
true
Token true
Feature activation-0.130
,
Token,
Feature activation-0.006
though
Token though
Feature activation-0.039
is
Token is
Feature activation-0.056
true
Token true
Feature activation-0.124
,
Token,
Feature activation-0.008
though
Token though
Feature activation-0.069
,
Token,
Feature activation-0.057
is
Token is
Feature activation+0.011
that
Token that
Feature activation-0.074
whatever
Token whatever
Feature activation-0.072
policy
Token policy
Feature activation-0.227
prescriptions
Token prescriptions
Feature activation-0.151
that
Token that
Feature activation-0.205
prescriptions
Token prescriptions
Feature activation-0.090
that
Token that
Feature activation-0.139
we
Token we
Feature activation-0.186
've
Token've
Feature activation+0.003
been
Token been
Feature activation+0.011
proposing
Token proposing
Feature activation+0.107
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.113
policy
Token policy
Feature activation-0.025
prescriptions
Token prescriptions
Feature activation-0.075
that
Token that
Feature activation-0.033
we
Token we
Feature activation-0.016
've
Token've
Feature activation+0.008
been
Token been
Feature activation-0.007
proposing
Token proposing
Feature activation-0.017
don
Token don
Feature activation-0.363
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
is
Token is
Feature activation-0.199
true
Token true
Feature activation-0.513
,
Token,
Feature activation-0.021
though
Token though
Feature activation-0.126
,
Token,
Feature activation-0.087
is
Token is
Feature activation+0.071
that
Token that
Feature activation+0.046
whatever
Token whatever
Feature activation-0.267
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.925
perspective
Token perspective
Feature activation-0.087
.
Token.
Feature activation-0.049
"
Token "
Feature activation+0.184
What
TokenWhat
Feature activation-0.214
is
Token is
Feature activation-0.146
true
Token true
Feature activation-0.354
,
Token,
Feature activation-0.071
though
Token though
Feature activation-0.177
is
Token is
Feature activation-0.175
true
Token true
Feature activation-0.309
,
Token,
Feature activation-0.049
though
Token though
Feature activation-0.472
,
Token,
Feature activation-0.195
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.843
perspective
Token perspective
Feature activation-0.089
.
Token.
Feature activation-0.041
"
Token "
Feature activation+0.130
What
TokenWhat
Feature activation-0.235
is
Token is
Feature activation-0.112
true
Token true
Feature activation-0.659
,
Token,
Feature activation-0.019
though
Token though
Feature activation-0.307

Decoder Weights Distribution

Head 0: 0.09

Head 1: 0.09

Head 2: 0.08

Head 3: 0.09

Head 4: 0.07

Head 5: 0.09

Head 6: 0.09

Head 7: 0.08

Head 8: 0.08

Head 9: 0.08

Head 10: 0.07

Head 11: 0.09

Positive logits

MpServer3.20

CoC3.09

tort3.02

erred2.58

consensual2.51

enance2.47

BDS2.43

POV2.43

Polic2.40

STD2.40

unethical2.39

Transparency2.36

External2.36

Zer2.36

Tsuk2.36

Genocide2.36

Misc2.35

tortured2.34

Haku2.32

untarily2.31

Negative logits

-3.29

owers-2.88

Â-2.86

­-2.81

ower-2.76

soon-2.55

prus-2.51

mercial-2.49

Bey-2.41

install-2.38

vin-2.37

··-2.35

ople-2.34

ogg-2.34

yah-2.30

amina-2.30

-2.29

-2.29

Wonder-2.27

Continent-2.23

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

o
Tokeno
Feature activation+0.000
,
Token,
Feature activation+0.000
would
Token would
Feature activation+0.000
eliminate
Token eliminate
Feature activation+0.000
the
Token the
Feature activation+0.000
for
Token for
Feature activation+0.000
-
Token-
Feature activation+0.000
profit
Tokenprofit
Feature activation+0.000
health
Token health
Feature activation+0.000
insurance
Token insurance
Feature activation+0.000
middle
Token middle
Feature activation+0.000
a
Token a
Feature activation+0.000
legal
Token legal
Feature activation+0.000
claim
Token claim
Feature activation+0.000
against
Token against
Feature activation+0.000
the
Token the
Feature activation+0.000
city
Token city
Feature activation+0.000
of
Token of
Feature activation+0.000
B
Token B
Feature activation+0.000
akers
Tokenakers
Feature activation+0.000
field
Tokenfield
Feature activation+0.000
.
Token.
Feature activation+0.000
our
Token our
Feature activation+0.000
channels
Token channels
Feature activation+0.000
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Be
TokenBe
Feature activation+0.000
avis
Tokenavis
Feature activation+0.000
and
Token and
Feature activation+0.000
Butt
Token Butt
Feature activation+0.000
-
Token-
Feature activation+0.000
head
Tokenhead
Feature activation+0.000
Rudd
Token Rudd
Feature activation+0.000
responded
Token responded
Feature activation+0.000
by
Token by
Feature activation+0.000
condemning
Token condemning
Feature activation+0.000
the
Token the
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
de
Tokende
Feature activation+0.000
pl
Tokenpl
Feature activation+0.000
orable
Tokenorable
Feature activation+0.000
rise
Token rise
Feature activation+0.000
podcast
Token podcast
Feature activation+0.000
,
Token,
Feature activation+0.000
a
Token a
Feature activation+0.000
news
Token news
Feature activation+0.000
report
Token report
Feature activation+0.000
made
Token made
Feature activation+0.000
while
Token while
Feature activation+0.000
none
Token none
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
accused
Token accused
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 25: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 0.951

though
Token though
Feature activation-0.113
,
Token,
Feature activation-0.127
is
Token is
Feature activation-0.151
that
Token that
Feature activation-0.644
whatever
Token whatever
Feature activation-0.507
policy
Token policy
Feature activation+0.001
prescriptions
Token prescriptions
Feature activation-0.173
that
Token that
Feature activation-0.172
we
Token we
Feature activation-0.170
've
Token've
Feature activation-0.004
been
Token been
Feature activation-0.319
proposing
Token proposing
Feature activation-0.338
don
Token don
Feature activation-0.340
't
Token't
Feature activation-0.256
reach
Token reach
Feature activation-0.925
,
Token,
Feature activation-0.705
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation-0.026
that
Token that
Feature activation-0.098
we
Token we
Feature activation-0.292
've
Token've
Feature activation-0.245
been
Token been
Feature activation-0.276
proposing
Token proposing
Feature activation+0.257
don
Token don
Feature activation-1.563
't
Token't
Feature activation-0.495
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
been
Token been
Feature activation-0.175
proposing
Token proposing
Feature activation-0.176
don
Token don
Feature activation-0.254
't
Token't
Feature activation-0.291
reach
Token reach
Feature activation-1.132
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
.
Token.
Feature activation-0.090
"
Token "
Feature activation-0.191
What
TokenWhat
Feature activation-0.797
is
Token is
Feature activation-0.815
true
Token true
Feature activation-0.617
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
perspective
Token perspective
Feature activation-0.211
.
Token.
Feature activation-0.240
"
Token "
Feature activation-0.263
What
TokenWhat
Feature activation-0.721
is
Token is
Feature activation-0.616
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.341
perspective
Token perspective
Feature activation-0.332
.
Token.
Feature activation+0.002
"
Token "
Feature activation+0.216
What
TokenWhat
Feature activation-0.983
is
Token is
Feature activation-0.569
true
Token true
Feature activation-1.259
,
Token,
Feature activation-0.302
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.084
perspective
Token perspective
Feature activation-0.165
.
Token.
Feature activation+0.951
"
Token "
Feature activation-0.298
What
TokenWhat
Feature activation-0.812
is
Token is
Feature activation-0.224
true
Token true
Feature activation-0.278
,
Token,
Feature activation+0.108
,
Token,
Feature activation-0.159
is
Token is
Feature activation-0.249
that
Token that
Feature activation-0.851
whatever
Token whatever
Feature activation-0.412
policy
Token policy
Feature activation-0.196
prescriptions
Token prescriptions
Feature activation+0.468
that
Token that
Feature activation-0.391
we
Token we
Feature activation-0.523
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
that
Token that
Feature activation-0.566
whatever
Token whatever
Feature activation-0.525
policy
Token policy
Feature activation-0.033
prescriptions
Token prescriptions
Feature activation-0.239
that
Token that
Feature activation-0.061
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
,
Token,
Feature activation-0.153
is
Token is
Feature activation-0.250
that
Token that
Feature activation-0.642
whatever
Token whatever
Feature activation-0.444
policy
Token policy
Feature activation-0.236
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
though
Token though
Feature activation-0.496
,
Token,
Feature activation-0.229
is
Token is
Feature activation-0.296
that
Token that
Feature activation-0.535
whatever
Token whatever
Feature activation-0.818
policy
Token policy
Feature activation+0.159
prescriptions
Token prescriptions
Feature activation-0.153
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation-0.157
is
Token is
Feature activation-0.109
that
Token that
Feature activation-0.577
whatever
Token whatever
Feature activation-0.462
policy
Token policy
Feature activation-0.161
prescriptions
Token prescriptions
Feature activation+0.469
that
Token that
Feature activation-0.350
we
Token we
Feature activation-0.536
've
Token've
Feature activation-0.344
been
Token been
Feature activation-0.846
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.156
is
Token is
Feature activation-0.192
that
Token that
Feature activation-0.848
whatever
Token whatever
Feature activation-0.503
policy
Token policy
Feature activation-0.127
prescriptions
Token prescriptions
Feature activation+0.556
that
Token that
Feature activation-0.492
we
Token we
Feature activation-0.449
've
Token've
Feature activation-0.626
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.242
that
Token that
Feature activation-0.128
we
Token we
Feature activation-0.187
've
Token've
Feature activation-0.172
been
Token been
Feature activation-0.257
proposing
Token proposing
Feature activation+0.425
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
whatever
Token whatever
Feature activation-0.221
policy
Token policy
Feature activation-0.073
prescriptions
Token prescriptions
Feature activation-0.282
that
Token that
Feature activation-0.058
we
Token we
Feature activation-0.019
've
Token've
Feature activation+0.057
been
Token been
Feature activation-0.045
proposing
Token proposing
Feature activation-0.048
don
Token don
Feature activation-10.578
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
though
Token though
Feature activation-0.316
,
Token,
Feature activation-0.163
is
Token is
Feature activation-0.377
that
Token that
Feature activation-0.240
whatever
Token whatever
Feature activation-0.537
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.889
perspective
Token perspective
Feature activation-0.010
.
Token.
Feature activation+0.024
"
Token "
Feature activation+0.037
What
TokenWhat
Feature activation-0.385
is
Token is
Feature activation-0.221
true
Token true
Feature activation-0.546
,
Token,
Feature activation-0.240
though
Token though
Feature activation-0.505
<|endoftext|>
Token<|endoftext|>
Feature activation-6.663
perspective
Token perspective
Feature activation+0.109
.
Token.
Feature activation+0.108
"
Token "
Feature activation-0.074
What
TokenWhat
Feature activation-0.880
is
Token is
Feature activation-0.517
true
Token true
Feature activation-0.360
<|endoftext|>
Token<|endoftext|>
Feature activation-6.719
perspective
Token perspective
Feature activation-0.022
.
Token.
Feature activation+0.301
"
Token "
Feature activation+0.048
What
TokenWhat
Feature activation-0.519
is
Token is
Feature activation-0.343
true
Token true
Feature activation-1.191
,
Token,
Feature activation-0.159

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.07

Head 2: 0.08

Head 3: 0.08

Head 4: 0.08

Head 5: 0.09

Head 6: 0.08

Head 7: 0.09

Head 8: 0.09

Head 9: 0.09

Head 10: 0.09

Head 11: 0.09

Positive logits

Pradesh3.18

Dialogue3.13

Mulcair3.10

Sabha3.03

Topics2.91

uckland2.81

Tik2.79

Labour2.74

ropolis2.73

jriwal2.67

ontent2.67

utan2.65

Trudeau2.65

ameron2.65

2.63

Vaj2.60

aucus2.60

inar2.59

ILCS2.58

#$#$2.57

Negative logits

psy-2.86

Cra-2.80

millenn-2.75

Da-2.75

gen-2.59

FA-2.53

iversary-2.49

Cra-2.48

Fen-2.47

NING-2.40

aiman-2.37

Innocent-2.35

Carnival-2.32

Fel-2.31

mys-2.31

opio-2.30

worms-2.29

Fell-2.29

Cosmic-2.25

Fool-2.24

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

the
Token the
Feature activation+0.000
park
Token park
Feature activation+0.000
had
Token had
Feature activation+0.000
not
Token not
Feature activation+0.000
been
Token been
Feature activation+0.000
made
Token made
Feature activation+0.000
lightly
Token lightly
Feature activation+0.000
but
Token but
Feature activation+0.000
the
Token the
Feature activation+0.000
welfare
Token welfare
Feature activation+0.000
of
Token of
Feature activation+0.000
the
Token the
Feature activation+0.000
strongest
Token strongest
Feature activation+0.000
possible
Token possible
Feature activation+0.000
language
Token language
Feature activation+0.000
threats
Token threats
Feature activation+0.000
of
Token of
Feature activation+0.000
violence
Token violence
Feature activation+0.000
targeting
Token targeting
Feature activation+0.000
feminists
Token feminists
Feature activation+0.000
and
Token and
Feature activation+0.000
women
Token women
Feature activation+0.000
social
Token social
Feature activation+0.000
classes
Token classes
Feature activation+0.000
prior
Token prior
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
French
Token French
Feature activation+0.000
Revolution
Token Revolution
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
B
TokenB
Feature activation+0.000
ane
Tokenane
Feature activation+0.000
justice
Token justice
Feature activation+0.000
."
Token."
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Core
TokenCore
Feature activation+0.000
y
Tokeny
Feature activation+0.000
's
Token's
Feature activation+0.000
office
Token office
Feature activation+0.000
argued
Token argued
Feature activation+0.000
that
Token that
Feature activation+0.000
Alexander
Token Alexander
Feature activation+0.000
ization
Tokenization
Feature activation+0.000
payment
Token payment
Feature activation+0.000
for
Token for
Feature activation+0.000
over
Token over
Feature activation+0.000
50
Token 50
Feature activation+0.000
years
Token years
Feature activation+0.000
and
Token and
Feature activation+0.000
has
Token has
Feature activation+0.000
done
Token done
Feature activation+0.000
more
Token more
Feature activation+0.000
than
Token than
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 26: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 2.043

,
Token,
Feature activation-0.104
is
Token is
Feature activation+0.024
that
Token that
Feature activation-0.398
whatever
Token whatever
Feature activation-0.591
policy
Token policy
Feature activation+0.078
prescriptions
Token prescriptions
Feature activation+0.264
that
Token that
Feature activation-0.260
we
Token we
Feature activation-0.121
've
Token've
Feature activation+0.042
been
Token been
Feature activation-0.182
proposing
Token proposing
Feature activation-0.110
.
Token.
Feature activation-0.018
"
Token "
Feature activation-0.013
What
TokenWhat
Feature activation-0.476
is
Token is
Feature activation-0.249
true
Token true
Feature activation-0.512
,
Token,
Feature activation+0.036
though
Token though
Feature activation-0.414
,
Token,
Feature activation-0.056
is
Token is
Feature activation-0.136
that
Token that
Feature activation-0.262
whatever
Token whatever
Feature activation-0.618
prescriptions
Token prescriptions
Feature activation+0.110
that
Token that
Feature activation-0.115
we
Token we
Feature activation-0.057
've
Token've
Feature activation+0.015
been
Token been
Feature activation-0.082
proposing
Token proposing
Feature activation+0.404
don
Token don
Feature activation-0.777
't
Token't
Feature activation-0.466
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.773
,
Token,
Feature activation-0.197
though
Token though
Feature activation-0.752
,
Token,
Feature activation-0.139
is
Token is
Feature activation+0.046
that
Token that
Feature activation+0.313
whatever
Token whatever
Feature activation-0.504
policy
Token policy
Feature activation-0.525
prescriptions
Token prescriptions
Feature activation-0.192
that
Token that
Feature activation-0.142
we
Token we
Feature activation-0.038
.
Token.
Feature activation-0.401
"
Token "
Feature activation-0.335
What
TokenWhat
Feature activation-0.625
is
Token is
Feature activation-0.406
true
Token true
Feature activation-0.724
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.553
perspective
Token perspective
Feature activation-0.280
.
Token.
Feature activation-0.282
"
Token "
Feature activation+0.023
What
TokenWhat
Feature activation-0.507
is
Token is
Feature activation-0.635
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.278
perspective
Token perspective
Feature activation-0.491
.
Token.
Feature activation+0.031
"
Token "
Feature activation+0.754
What
TokenWhat
Feature activation-0.865
is
Token is
Feature activation-0.663
true
Token true
Feature activation-1.304
,
Token,
Feature activation-0.343
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.182
.
Token.
Feature activation-0.366
"
Token "
Feature activation-3.042
What
TokenWhat
Feature activation-0.701
is
Token is
Feature activation-0.316
true
Token true
Feature activation+0.644
,
Token,
Feature activation-0.257
though
Token though
Feature activation-3.110
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.077
is
Token is
Feature activation-0.070
that
Token that
Feature activation-0.560
whatever
Token whatever
Feature activation-0.354
policy
Token policy
Feature activation-0.049
prescriptions
Token prescriptions
Feature activation+1.850
that
Token that
Feature activation-0.523
we
Token we
Feature activation-0.419
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
though
Token though
Feature activation-0.594
,
Token,
Feature activation-0.153
is
Token is
Feature activation+0.081
that
Token that
Feature activation-0.183
whatever
Token whatever
Feature activation-0.518
policy
Token policy
Feature activation+0.128
prescriptions
Token prescriptions
Feature activation-0.032
that
Token that
Feature activation-0.243
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
though
Token though
Feature activation-0.601
,
Token,
Feature activation-0.169
is
Token is
Feature activation-0.020
that
Token that
Feature activation-0.122
whatever
Token whatever
Feature activation-0.398
policy
Token policy
Feature activation+0.115
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
though
Token though
Feature activation-0.829
,
Token,
Feature activation-0.252
is
Token is
Feature activation-0.008
that
Token that
Feature activation+0.130
whatever
Token whatever
Feature activation-0.791
policy
Token policy
Feature activation+0.332
prescriptions
Token prescriptions
Feature activation+0.254
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation-0.117
is
Token is
Feature activation+0.053
that
Token that
Feature activation-0.398
whatever
Token whatever
Feature activation-0.397
policy
Token policy
Feature activation+0.038
prescriptions
Token prescriptions
Feature activation+1.845
that
Token that
Feature activation-0.387
we
Token we
Feature activation-0.254
've
Token've
Feature activation-0.079
been
Token been
Feature activation-0.583
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.153
is
Token is
Feature activation-0.032
that
Token that
Feature activation-0.561
whatever
Token whatever
Feature activation-0.437
policy
Token policy
Feature activation+0.083
prescriptions
Token prescriptions
Feature activation+2.043
that
Token that
Feature activation-0.587
we
Token we
Feature activation-0.348
've
Token've
Feature activation-0.472
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.145
is
Token is
Feature activation-0.117
that
Token that
Feature activation-0.856
whatever
Token whatever
Feature activation-0.659
policy
Token policy
Feature activation+0.517
prescriptions
Token prescriptions
Feature activation+0.851
that
Token that
Feature activation-0.279
we
Token we
Feature activation-0.150
've
Token've
Feature activation+0.016
been
Token been
Feature activation-0.095
proposing
Token proposing
Feature activation+0.639
whatever
Token whatever
Feature activation-0.240
policy
Token policy
Feature activation-0.079
prescriptions
Token prescriptions
Feature activation-0.262
that
Token that
Feature activation-0.068
we
Token we
Feature activation-0.020
've
Token've
Feature activation+0.023
been
Token been
Feature activation-0.026
proposing
Token proposing
Feature activation-0.035
don
Token don
Feature activation-5.714
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
true
Token true
Feature activation-0.884
,
Token,
Feature activation-0.137
though
Token though
Feature activation-0.558
,
Token,
Feature activation-0.194
is
Token is
Feature activation+0.140
that
Token that
Feature activation+0.201
whatever
Token whatever
Feature activation-0.641
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
is
Token is
Feature activation-0.273
true
Token true
Feature activation-0.567
,
Token,
Feature activation-0.311
though
Token though
Feature activation-0.866
,
Token,
Feature activation-0.227
is
Token is
Feature activation+0.294
that
Token that
Feature activation+0.173
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
perspective
Token perspective
Feature activation-0.269
.
Token.
Feature activation-0.141
"
Token "
Feature activation-0.357
What
TokenWhat
Feature activation-0.862
is
Token is
Feature activation-0.416
true
Token true
Feature activation+0.424
,
Token,
Feature activation-0.196
though
Token though
Feature activation-2.220
,
Token,
Feature activation-0.596
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
true
Token true
Feature activation-1.161
,
Token,
Feature activation-0.119
though
Token though
Feature activation-1.459
,
Token,
Feature activation-0.372
is
Token is
Feature activation-0.144
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000

Decoder Weights Distribution

Head 0: 0.10

Head 1: 0.07

Head 2: 0.09

Head 3: 0.08

Head 4: 0.06

Head 5: 0.09

Head 6: 0.10

Head 7: 0.08

Head 8: 0.08

Head 9: 0.08

Head 10: 0.09

Head 11: 0.07

Positive logits

Luxem3.49

stellar3.31

Lans3.23

uru3.10

Mountain3.05

Sutton3.01

Yon3.00

Picard3.00

grad2.97

Laurel2.96

Rider2.89

Riders2.76

Rwanda2.76

HK2.71

Kitt2.65

Luxembourg2.63

Hazel2.63

Ship2.62

Hull2.57

cession2.57

Negative logits

-2.89

Play-2.89

-2.85

-2.74

Xbox-2.64

],[-2.63

Ryan-2.62

CLIENT-2.59

pse-2.56

trump-2.54

perf-2.54

dips-2.53

Decoder-2.52

predec-2.52

2048-2.48

-2.48

-2.47

epis-2.47

Topic-2.45

epid-2.45

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

their
Token their
Feature activation+0.000
support
Token support
Feature activation+0.000
.
Token.
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
There
TokenThere
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
ize
Tokenize
Feature activation+0.000
the
Token the
Feature activation+0.000
incidents
Token incidents
Feature activation+0.000
for
Token for
Feature activation+0.000
fear
Token fear
Feature activation+0.000
that
Token that
Feature activation+0.000
it
Token it
Feature activation+0.000
could
Token could
Feature activation+0.000
provoke
Token provoke
Feature activation+0.000
a
Token a
Feature activation+0.000
xen
Token xen
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
these
Token these
Feature activation+0.000
utter
Token utter
Feature activation+0.000
ances
Tokenances
Feature activation+0.000
of
Token of
Feature activation+0.000
sacred
Token sacred
Feature activation+0.000
desperation
Token desperation
Feature activation+0.000
signal
Token signal
Feature activation+0.000
that
Token that
Feature activation+0.000
Un
Token Un
Feature activation+0.000
has
Token has
Feature activation+0.000
encouraged
Token encouraged
Feature activation+0.000
them
Token them
Feature activation+0.000
to
Token to
Feature activation+0.000
skip
Token skip
Feature activation+0.000
the
Token the
Feature activation+0.000
convention
Token convention
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
He
TokenHe
Feature activation+0.000
's
Token's
Feature activation+0.000
legal
Token legal
Feature activation+0.000
principle
Token principle
Feature activation+0.000
on
Token on
Feature activation+0.000
its
Token its
Feature activation+0.000
head
Token head
Feature activation+0.000
.
Token.
Feature activation+0.000
After
Token After
Feature activation+0.000
all
Token all
Feature activation+0.000
,
Token,
Feature activation+0.000
there
Token there
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 27: Uninterpretable

TOP ACTIVATIONS
MAX = 4.255

<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Tall
Token Tall
Feature activation+1.459
ah
Tokenah
Feature activation+0.000
as
Tokenas
Feature activation+0.000
see
Tokensee
Feature activation+3.564
and
Token and
Feature activation+1.592
her
Token her
Feature activation+0.678
colleagues
Token colleagues
Feature activation+0.668
have
Token have
Feature activation+0.000
obtained
Token obtained
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Su
Token Su
Feature activation+1.485
u
Tokenu
Feature activation+0.809
Ky
Token Ky
Feature activation+0.000
i
Tokeni
Feature activation+3.073
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.664
s
Tokens
Feature activation+1.190
big
Token big
Feature activation+0.264
-
Token-
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Washington
Token Washington
Feature activation+3.276
D
Token D
Feature activation+0.000
.
Token.
Feature activation+0.000
C
TokenC
Feature activation+2.549
.
Token.
Feature activation+1.889
seem
Token seem
Feature activation+1.256
to
Token to
Feature activation+0.406
be
Token be
Feature activation+0.085
paying
Token paying
Feature activation+0.548
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Washington
Token Washington
Feature activation+3.276
D
Token D
Feature activation+0.000
.
Token.
Feature activation+0.000
C
TokenC
Feature activation+2.549
.,
Token.,
Feature activation+1.270
Boston
Token Boston
Feature activation+1.278
P
Token P
Feature activation+0.810
ops
Tokenops
Feature activation+0.959
Fire
Token Fire
Feature activation+0.383
in
Token in
Feature activation+3.798
lame
Token lame
Feature activation+2.834
-
Token-
Feature activation+0.695
du
Tokendu
Feature activation+0.223
ck
Tokenck
Feature activation+2.332
territory
Token territory
Feature activation+2.536
and
Token and
Feature activation+1.123
his
Token his
Feature activation+0.391
GOP
Token GOP
Feature activation+0.000
opponents
Token opponents
Feature activation+0.000
in
Token in
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
slightly
Token slightly
Feature activation+2.603
sm
Token sm
Feature activation+1.099
elly
Tokenelly
Feature activation+2.466
vegetable
Token vegetable
Feature activation+1.215
since
Token since
Feature activation+0.803
Mr
Token Mr
Feature activation+0.510
Abbott
Token Abbott
Feature activation+0.206
's
Token's
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Wil
Token Wil
Feature activation+1.709
hel
Tokenhel
Feature activation+0.000
min
Tokenmin
Feature activation+0.000
a
Tokena
Feature activation+2.441
until
Token until
Feature activation+0.000
you
Token you
Feature activation+0.000
get
Token get
Feature activation+0.000
to
Token to
Feature activation+0.000
the
Token the
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
U
Token U
Feature activation+1.984
.
Token.
Feature activation+1.057
S
TokenS
Feature activation+2.399
.
Token.
Feature activation+1.953
from
Token from
Feature activation+1.137
2007
Token 2007
Feature activation+0.582
to
Token to
Feature activation+0.143
2015
Token 2015
Feature activation+0.806
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
U
Token U
Feature activation+1.984
.
Token.
Feature activation+1.057
S
TokenS
Feature activation+2.399
.
Token.
Feature activation+1.953
House
Token House
Feature activation+1.479
of
Token of
Feature activation+0.036
Representatives
Token Representatives
Feature activation+0.596
will
Token will
Feature activation+0.291
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
-
Token-
Feature activation+2.548
game
Tokengame
Feature activation+2.840
in
Token in
Feature activation+1.966
a
Token a
Feature activation+2.393
small
Token small
Feature activation+1.787
number
Token number
Feature activation+1.593
of
Token of
Feature activation+1.213
situations
Token situations
Feature activation+0.797
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
less
Token less
Feature activation+2.804
than
Token than
Feature activation+0.058
five
Token five
Feature activation+0.641
years
Token years
Feature activation+2.390
many
Token many
Feature activation+0.901
less
Token less
Feature activation+0.000
-
Token-
Feature activation+0.000
dem
Tokendem
Feature activation+0.000
anding
Tokenanding
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
2020
Token 2020
Feature activation+1.596
general
Token general
Feature activation+0.575
election
Token election
Feature activation+2.358
,
Token,
Feature activation+0.206
a
Token a
Feature activation+0.199
commission
Token commission
Feature activation+0.000
set
Token set
Feature activation+0.000
up
Token up
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
lame
Token lame
Feature activation+2.834
-
Token-
Feature activation+0.695
du
Tokendu
Feature activation+0.223
ck
Tokenck
Feature activation+2.332
territory
Token territory
Feature activation+2.536
and
Token and
Feature activation+1.123
his
Token his
Feature activation+0.391
GOP
Token GOP
Feature activation+0.000
opponents
Token opponents
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
time
Token time
Feature activation+3.275
and
Token and
Feature activation+1.357
in
Token in
Feature activation+0.426
quality
Token quality
Feature activation+2.316
of
Token of
Feature activation+0.214
life
Token life
Feature activation+1.660
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
both
Token both
Feature activation+3.661
power
Token power
Feature activation+1.505
and
Token and
Feature activation+1.318
deed
Token deed
Feature activation+2.300
.
Token.
Feature activation+0.498
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
tablets
Token tablets
Feature activation+3.529
as
Token as
Feature activation+2.395
in
Token in
Feature activation+0.956
smartphones
Token smartphones
Feature activation+2.279
,
Token,
Feature activation+0.055
they
Token they
Feature activation+0.125
have
Token have
Feature activation+0.108
instead
Token instead
Feature activation+0.001
become
Token become
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
House
Token House
Feature activation+3.055
and
Token and
Feature activation+1.667
Senate
Token Senate
Feature activation+2.268
sounded
Token sounded
Feature activation+1.003
different
Token different
Feature activation+0.422
notes
Token notes
Feature activation+0.222
on
Token on
Feature activation+0.258
the
Token the
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
paper
Token paper
Feature activation+3.124
in
Token in
Feature activation+3.386
addition
Token addition
Feature activation+2.360
to
Token to
Feature activation+2.245
the
Token the
Feature activation+2.229
electronic
Token electronic
Feature activation+1.648
version
Token version
Feature activation+1.083
.
Token.
Feature activation+0.895
Ċ
TokenĊ
Feature activation+0.196
in
Token in
Feature activation+3.798
paper
Token paper
Feature activation+3.124
in
Token in
Feature activation+3.386
addition
Token addition
Feature activation+2.360
to
Token to
Feature activation+2.245
the
Token the
Feature activation+2.229
electronic
Token electronic
Feature activation+1.648
version
Token version
Feature activation+1.083
.
Token.
Feature activation+0.895
Ċ
TokenĊ
Feature activation+0.196
Ċ
TokenĊ
Feature activation+1.098
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
pian
Token pian
Feature activation+1.999
of
Tokenof
Feature activation+0.000
ort
Tokenort
Feature activation+0.461
e
Tokene
Feature activation+2.226
.[
Token.[
Feature activation+0.328
8
Token8
Feature activation+0.342
]
Token]
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000

Top DFA by src position
MAX = 4.053

<|endoftext|>
Token<|endoftext|>
Feature activation-1.480
in
Token in
Feature activation+3.840
Tall
Token Tall
Feature activation-0.061
ah
Tokenah
Feature activation-0.017
as
Tokenas
Feature activation+0.072
see
Tokensee
Feature activation+0.918
and
Token and
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.523
in
Token in
Feature activation+4.053
Su
Token Su
Feature activation+0.214
u
Tokenu
Feature activation-0.121
Ky
Token Ky
Feature activation+0.043
i
Tokeni
Feature activation+0.115
âĢ
TokenâĢ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.578
in
Token in
Feature activation+2.713
Washington
Token Washington
Feature activation+0.735
D
Token D
Feature activation+0.111
.
Token.
Feature activation-0.174
C
TokenC
Feature activation+0.447
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.578
in
Token in
Feature activation+2.713
Washington
Token Washington
Feature activation+0.735
D
Token D
Feature activation+0.111
.
Token.
Feature activation-0.174
C
TokenC
Feature activation+0.447
.,
Token.,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.507
in
Token in
Feature activation+2.721
lame
Token lame
Feature activation+0.034
-
Token-
Feature activation-0.058
du
Tokendu
Feature activation+0.063
ck
Tokenck
Feature activation+0.631
territory
Token territory
Feature activation+0.360
<|endoftext|>
Token<|endoftext|>
Feature activation-1.676
in
Token in
Feature activation+2.922
the
Token the
Feature activation+0.460
slightly
Token slightly
Feature activation+0.259
sm
Token sm
Feature activation-0.012
elly
Tokenelly
Feature activation+0.220
vegetable
Token vegetable
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.436
in
Token in
Feature activation+3.911
Wil
Token Wil
Feature activation-0.200
hel
Tokenhel
Feature activation-0.111
min
Tokenmin
Feature activation-0.305
a
Tokena
Feature activation+0.289
until
Token until
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.587
in
Token in
Feature activation+3.475
the
Token the
Feature activation+0.169
U
Token U
Feature activation+0.137
.
Token.
Feature activation-0.220
S
TokenS
Feature activation+0.132
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.587
in
Token in
Feature activation+3.475
the
Token the
Feature activation+0.169
U
Token U
Feature activation+0.137
.
Token.
Feature activation-0.220
S
TokenS
Feature activation+0.132
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.550
in
Token in
Feature activation+2.231
-
Token-
Feature activation+0.126
game
Tokengame
Feature activation+0.528
in
Token in
Feature activation+0.645
a
Token a
Feature activation+0.119
small
Token small
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.542
in
Token in
Feature activation+3.438
less
Token less
Feature activation-0.011
than
Token than
Feature activation-0.104
five
Token five
Feature activation-0.086
years
Token years
Feature activation+0.402
many
Token many
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.477
in
Token in
Feature activation+2.766
the
Token the
Feature activation+0.035
2020
Token 2020
Feature activation+0.399
general
Token general
Feature activation+0.033
election
Token election
Feature activation+0.309
,
Token,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.619
in
Token in
Feature activation+3.051
lame
Token lame
Feature activation+0.338
-
Token-
Feature activation-0.109
du
Tokendu
Feature activation+0.049
ck
Tokenck
Feature activation+0.329
territory
Token territory
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.503
in
Token in
Feature activation+3.445
time
Token time
Feature activation+0.226
and
Token and
Feature activation-0.128
in
Token in
Feature activation+0.010
quality
Token quality
Feature activation-0.027
of
Token of
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.511
in
Token in
Feature activation+3.318
both
Token both
Feature activation+0.195
power
Token power
Feature activation-0.236
and
Token and
Feature activation-0.047
deed
Token deed
Feature activation+0.286
.
Token.
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.163
in
Token in
Feature activation+2.197
tablets
Token tablets
Feature activation+1.143
as
Token as
Feature activation-0.656
in
Token in
Feature activation-0.045
smartphones
Token smartphones
Feature activation+0.510
,
Token,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.475
in
Token in
Feature activation+2.183
the
Token the
Feature activation-0.010
House
Token House
Feature activation+0.970
and
Token and
Feature activation-0.114
Senate
Token Senate
Feature activation+0.420
sounded
Token sounded
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.492
in
Token in
Feature activation+1.930
paper
Token paper
Feature activation+0.439
in
Token in
Feature activation+0.875
addition
Token addition
Feature activation+0.093
to
Token to
Feature activation+0.107
the
Token the
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-1.428
in
Token in
Feature activation+1.772
paper
Token paper
Feature activation+0.171
in
Token in
Feature activation+1.034
addition
Token addition
Feature activation+0.039
to
Token to
Feature activation+0.172
the
Token the
Feature activation+0.175
<|endoftext|>
Token<|endoftext|>
Feature activation-1.503
in
Token in
Feature activation+3.440
pian
Token pian
Feature activation+0.019
of
Tokenof
Feature activation-0.026
ort
Tokenort
Feature activation-0.139
e
Tokene
Feature activation+0.143
.[
Token.[
Feature activation+0.000

Decoder Weights Distribution

Head 0: 0.19

Head 1: 0.07

Head 2: 0.05

Head 3: 0.06

Head 4: 0.09

Head 5: 0.04

Head 6: 0.04

Head 7: 0.06

Head 8: 0.27

Head 9: 0.03

Head 10: 0.05

Head 11: 0.06

Positive logits

senal2.55

Leban2.48

carbohyd2.34

PDATE2.09

Moroc2.07

Nanto2.06

eleph1.97

sembly1.96

iven1.86

ortunately1.86

StreamerBot1.84

ioxide1.81

antidepress1.77

hovah1.76

subur1.74

pione1.72

��1.71

��士1.69

emetery1.68

inventoryQuantity1.68

Negative logits

-2.00

-1.74

-1.73

-1.67

-1.60

-1.54

-1.53

Chancellor-1.52

----1.49

inadvertently-1.48

disl-1.46

coinc-1.43

-1.38

pic-1.38

unknow-1.35

---1.35

-1.34

det-1.33

Temp-1.31

Das-1.30

INTERVAL 3.830 - 4.255
CONTAINS 0.000%

INTERVAL 3.404 - 3.830
CONTAINS 0.013%

<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Tall
Token Tall
Feature activation+1.459
ah
Tokenah
Feature activation+0.000
as
Tokenas
Feature activation+0.000
see
Tokensee
Feature activation+3.564
and
Token and
Feature activation+1.592
her
Token her
Feature activation+0.678
colleagues
Token colleagues
Feature activation+0.668
have
Token have
Feature activation+0.000
obtained
Token obtained
Feature activation+0.000

INTERVAL 2.979 - 3.404
CONTAINS 0.002%

<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Su
Token Su
Feature activation+1.485
u
Tokenu
Feature activation+0.809
Ky
Token Ky
Feature activation+0.000
i
Tokeni
Feature activation+3.073
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.664
s
Tokens
Feature activation+1.190
big
Token big
Feature activation+0.264
-
Token-
Feature activation+0.000

INTERVAL 2.553 - 2.979
CONTAINS 0.004%

INTERVAL 2.128 - 2.553
CONTAINS 0.004%

<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
time
Token time
Feature activation+3.275
and
Token and
Feature activation+1.357
in
Token in
Feature activation+0.426
quality
Token quality
Feature activation+2.316
of
Token of
Feature activation+0.214
life
Token life
Feature activation+1.660
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
slightly
Token slightly
Feature activation+2.603
sm
Token sm
Feature activation+1.099
elly
Tokenelly
Feature activation+2.466
vegetable
Token vegetable
Feature activation+1.215
since
Token since
Feature activation+0.803
Mr
Token Mr
Feature activation+0.510
Abbott
Token Abbott
Feature activation+0.206
's
Token's
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
House
Token House
Feature activation+3.055
and
Token and
Feature activation+1.667
Senate
Token Senate
Feature activation+2.268
sounded
Token sounded
Feature activation+1.003
different
Token different
Feature activation+0.422
notes
Token notes
Feature activation+0.222
on
Token on
Feature activation+0.258
the
Token the
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
lame
Token lame
Feature activation+2.834
-
Token-
Feature activation+0.695
du
Tokendu
Feature activation+0.223
ck
Tokenck
Feature activation+2.332
territory
Token territory
Feature activation+2.536
and
Token and
Feature activation+1.123
his
Token his
Feature activation+0.391
GOP
Token GOP
Feature activation+0.000
opponents
Token opponents
Feature activation+0.000
in
Token in
Feature activation+3.798
paper
Token paper
Feature activation+3.124
in
Token in
Feature activation+3.386
addition
Token addition
Feature activation+2.360
to
Token to
Feature activation+2.245
the
Token the
Feature activation+2.229
electronic
Token electronic
Feature activation+1.648
version
Token version
Feature activation+1.083
.
Token.
Feature activation+0.895
Ċ
TokenĊ
Feature activation+0.196
Ċ
TokenĊ
Feature activation+1.098

INTERVAL 1.702 - 2.128
CONTAINS 0.004%

<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
getting
Token getting
Feature activation+2.137
into
Token into
Feature activation+1.543
bar
Token bar
Feature activation+1.565
fights
Token fights
Feature activation+1.936
than
Token than
Feature activation+0.000
he
Token he
Feature activation+0.000
was
Token was
Feature activation+0.000
in
Token in
Feature activation+0.000
further
Token further
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
a
Token a
Feature activation+2.819
drugs
Token drugs
Feature activation+1.442
bust
Token bust
Feature activation+2.761
in
Token in
Feature activation+1.822
Mexico
Token Mexico
Feature activation+1.755
City
Token City
Feature activation+1.383
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
In
TokenIn
Feature activation+0.329
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
either
Token either
Feature activation+3.324
science
Token science
Feature activation+1.766
or
Token or
Feature activation+0.331
agriculture
Token agriculture
Feature activation+1.887
,
Token,
Feature activation+0.182
he
Token he
Feature activation+0.455
previously
Token previously
Feature activation+0.000
suggested
Token suggested
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
next
Token next
Feature activation+2.643
50
Token 50
Feature activation+1.825
years
Token years
Feature activation+1.735
when
Token when
Feature activation+0.187
demand
Token demand
Feature activation+0.000
for
Token for
Feature activation+0.000
food
Token food
Feature activation+0.000
is
Token is
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
trouble
Token trouble
Feature activation+4.033
with
Token with
Feature activation+1.901
the
Token the
Feature activation+0.708
law
Token law
Feature activation+1.917
,
Token,
Feature activation+0.161
visit
Token visit
Feature activation+0.214
http
Token http
Feature activation+0.000
://
Token://
Feature activation+0.000
www
Tokenwww
Feature activation+0.000

INTERVAL 1.277 - 1.702
CONTAINS 0.006%

<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Computer
Token Computer
Feature activation+1.680
Science
Token Science
Feature activation+2.424
from
Token from
Feature activation+1.774
the
Token the
Feature activation+1.538
University
Token University
Feature activation+0.576
of
Token of
Feature activation+0.202
Oxford
Token Oxford
Feature activation+0.447
.
Token.
Feature activation+0.239
He
Token He
Feature activation+0.382
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
Washington
Token Washington
Feature activation+3.276
County
Token County
Feature activation+2.613
Ċ
TokenĊ
Feature activation+0.170
Ċ
TokenĊ
Feature activation+1.628
On
TokenOn
Feature activation+0.319
Friday
Token Friday
Feature activation+0.000
,
Token,
Feature activation+0.000
Hills
Token Hills
Feature activation+0.000
boro
Tokenboro
Feature activation+0.000
Bor
Token Bor
Feature activation+0.000
ow
Tokenow
Feature activation+0.000
ie
Tokenie
Feature activation+0.000
cki
Tokencki
Feature activation+0.000
in
Token in
Feature activation+0.884
the
Token the
Feature activation+1.354
fifth
Token fifth
Feature activation+0.610
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
2009
Token2009
Feature activation+0.000
in
Token in
Feature activation+3.798
the
Token the
Feature activation+3.526
Belg
Token Belg
Feature activation+0.808
rav
Tokenrav
Feature activation+0.056
ia
Tokenia
Feature activation+2.168
district
Token district
Feature activation+1.492
of
Token of
Feature activation+0.000
London
Token London
Feature activation+0.380
,
Token,
Feature activation+0.000
Lee
Token Lee
Feature activation+0.000
began
Token began
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
their
Token their
Feature activation+3.017
own
Token own
Feature activation+1.468
section
Token section
Feature activation+3.070
,
Token,
Feature activation+1.290
called
Token called
Feature activation+2.196
P
Token P
Feature activation+0.978
UL
TokenUL
Feature activation+0.000
-
Token-
Feature activation+0.260
L
TokenL
Feature activation+0.000

INTERVAL 0.851 - 1.277
CONTAINS 0.012%

ebra
Tokenebra
Feature activation+1.350
fish
Tokenfish
Feature activation+1.768
and
Token and
Feature activation+1.160
CHO
Token CHO
Feature activation+0.355
cells
Token cells
Feature activation+1.219
(
Token (
Feature activation+0.949
A
TokenA
Feature activation+0.409
uer
Tokenuer
Feature activation+0.112
et
Token et
Feature activation+0.000
al
Token al
Feature activation+0.322
.,
Token.,
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
a
Token a
Feature activation+2.819
number
Token number
Feature activation+0.000
of
Token of
Feature activation+1.887
ways
Token ways
Feature activation+1.049
on
Token on
Feature activation+0.650
foreign
Token foreign
Feature activation+0.305
policy
Token policy
Feature activation+0.000
,
Token,
Feature activation+0.000
especially
Token especially
Feature activation+0.000
in
Token in
Feature activation+3.798
Z
Token Z
Feature activation+2.313
ab
Tokenab
Feature activation+0.742
r
Tokenr
Feature activation+0.411
ze
Tokenze
Feature activation+0.202
,
Token,
Feature activation+1.162
Medical
Token Medical
Feature activation+0.609
University
Token University
Feature activation+0.999
of
Token of
Feature activation+0.061
S
Token S
Feature activation+0.000
iles
Tokeniles
Feature activation+0.000
from
Token from
Feature activation+0.000
Sp
Token Sp
Feature activation+0.000
ry
Tokenry
Feature activation+0.000
field
Tokenfield
Feature activation+0.000
âĢĶ
Token âĢĶ
Feature activation+0.000
in
Token in
Feature activation+0.857
a
Token a
Feature activation+0.560
burned
Token burned
Feature activation+0.411
cabin
Token cabin
Feature activation+0.000
off
Token off
Feature activation+0.000
Highway
Token Highway
Feature activation+0.000
the
Token the
Feature activation+3.526
U
Token U
Feature activation+1.984
.
Token.
Feature activation+1.057
S
TokenS
Feature activation+2.399
.
Token.
Feature activation+1.953
from
Token from
Feature activation+1.137
2007
Token 2007
Feature activation+0.582
to
Token to
Feature activation+0.143
2015
Token 2015
Feature activation+0.806
when
Token when
Feature activation+0.114
it
Token it
Feature activation+0.000

INTERVAL 0.426 - 0.851
CONTAINS 0.025%

in
Token in
Feature activation+3.798
public
Token public
Feature activation+3.211
finances
Token finances
Feature activation+2.160
.
Token.
Feature activation+0.427
The
Token The
Feature activation+0.788
new
Token new
Feature activation+0.593
policy
Token policy
Feature activation+0.000
announced
Token announced
Feature activation+0.000
in
Token in
Feature activation+0.443
April
Token April
Feature activation+0.000
by
Token by
Feature activation+0.000
a
Token a
Feature activation+2.819
journey
Token journey
Feature activation+1.110
lasting
Token lasting
Feature activation+0.395
200
Token 200
Feature activation+0.599
to
Token to
Feature activation+0.046
250
Token 250
Feature activation+0.517
days
Token days
Feature activation+0.138
.
Token.
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
The
TokenThe
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
in
Token in
Feature activation+3.798
question
Token question
Feature activation+2.692
is
Token is
Feature activation+1.890
currently
Token currently
Feature activation+1.162
charged
Token charged
Feature activation+0.769
with
Token with
Feature activation+0.000
managing
Token managing
Feature activation+0.000
the
Token the
Feature activation+0.000
business
Token business
Feature activation+0.000
empire
Token empire
Feature activation+0.000
at
Token at
Feature activation+0.000
the
Token the
Feature activation+0.000
Monarch
Token Monarch
Feature activation+0.000
Theatre
Token Theatre
Feature activation+0.000
in
Token in
Feature activation+0.498
downtown
Token downtown
Feature activation+0.576
Phoenix
Token Phoenix
Feature activation+0.000
on
Token on
Feature activation+0.000
Saturday
Token Saturday
Feature activation+0.000
,
Token,
Feature activation+0.000
April
Token April
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
x
Tokenx
Feature activation+0.000
spe
Tokenspe
Feature activation+0.000
ier
Tokenier
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
Earlier
TokenEarlier
Feature activation+0.494
this
Token this
Feature activation+0.000
week
Token week
Feature activation+0.000
,
Token,
Feature activation+0.000
Bashar
Token Bashar
Feature activation+0.000
al
Token al
Feature activation+0.000

INTERVAL 0.000 - 0.426
CONTAINS 99.930%

that
Token that
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
why
Token why
Feature activation+0.000
he
Token he
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ļ
TokenĻ
Feature activation+0.000
s
Tokens
Feature activation+0.000
âĢ
Token âĢ
Feature activation+0.000
ľ
Tokenľ
Feature activation+0.000
to
Token to
Feature activation+0.000
work
Token work
Feature activation+0.000
to
Token to
Feature activation+0.000
reverse
Token reverse
Feature activation+0.000
-
Token-
Feature activation+0.000
engine
Tokenengine
Feature activation+0.000
er
Tokener
Feature activation+0.000
it
Token it
Feature activation+0.000
and
Token and
Feature activation+0.000
throw
Token throw
Feature activation+0.000
it
Token it
Feature activation+0.000
David
Token David
Feature activation+0.000
Mur
Token Mur
Feature activation+0.000
fee
Tokenfee
Feature activation+0.000
Faul
Token Faul
Feature activation+0.000
k
Tokenk
Feature activation+0.000
,
Token,
Feature activation+0.000
two
Token two
Feature activation+0.000
former
Token former
Feature activation+0.000
military
Token military
Feature activation+0.000
intercept
Token intercept
Feature activation+0.000
operators
Token operators
Feature activation+0.000
the
Token the
Feature activation+0.000
Sunday
Token Sunday
Feature activation+0.000
Times
Token Times
Feature activation+0.000
,
Token,
Feature activation+0.000
said
Token said
Feature activation+0.000
:
Token:
Feature activation+0.000
"
Token "
Feature activation+0.000
In
TokenIn
Feature activation+0.000
order
Token order
Feature activation+0.000
to
Token to
Feature activation+0.000
be
Token be
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
But
TokenBut
Feature activation+0.000
she
Token she
Feature activation+0.000
stopped
Token stopped
Feature activation+0.000
short
Token short
Feature activation+0.000
of
Token of
Feature activation+0.000
calling
Token calling
Feature activation+0.000
for
Token for
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 28: Dead

TOP ACTIVATIONS
MAX = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 2.426

,
Token,
Feature activation-0.083
is
Token is
Feature activation-0.099
that
Token that
Feature activation-0.257
whatever
Token whatever
Feature activation-0.392
policy
Token policy
Feature activation+0.020
prescriptions
Token prescriptions
Feature activation+0.348
that
Token that
Feature activation-0.142
we
Token we
Feature activation-0.029
've
Token've
Feature activation+0.173
been
Token been
Feature activation-0.171
proposing
Token proposing
Feature activation-0.082
<|endoftext|>
Token<|endoftext|>
Feature activation-6.715
perspective
Token perspective
Feature activation-0.106
.
Token.
Feature activation+0.051
"
Token "
Feature activation+0.357
What
TokenWhat
Feature activation-0.445
is
Token is
Feature activation-0.197
true
Token true
Feature activation-0.650
,
Token,
Feature activation+0.064
though
Token though
Feature activation-0.259
prescriptions
Token prescriptions
Feature activation+0.237
that
Token that
Feature activation+0.015
we
Token we
Feature activation+0.158
've
Token've
Feature activation+0.187
been
Token been
Feature activation+0.001
proposing
Token proposing
Feature activation+0.576
don
Token don
Feature activation-0.101
't
Token't
Feature activation-0.396
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
true
Token true
Feature activation-0.926
,
Token,
Feature activation-0.083
though
Token though
Feature activation-0.353
,
Token,
Feature activation-0.071
is
Token is
Feature activation-0.056
that
Token that
Feature activation+0.660
whatever
Token whatever
Feature activation-0.249
policy
Token policy
Feature activation-0.409
prescriptions
Token prescriptions
Feature activation-0.071
that
Token that
Feature activation+0.053
we
Token we
Feature activation+0.010
<|endoftext|>
Token<|endoftext|>
Feature activation-8.913
perspective
Token perspective
Feature activation-0.348
.
Token.
Feature activation-0.224
"
Token "
Feature activation+0.736
What
TokenWhat
Feature activation+0.197
is
Token is
Feature activation+0.132
true
Token true
Feature activation-0.697
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-9.125
perspective
Token perspective
Feature activation-0.286
.
Token.
Feature activation+0.043
"
Token "
Feature activation+1.965
What
TokenWhat
Feature activation+0.211
is
Token is
Feature activation-0.280
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.476
perspective
Token perspective
Feature activation-0.614
.
Token.
Feature activation+0.184
"
Token "
Feature activation+2.214
What
TokenWhat
Feature activation-0.512
is
Token is
Feature activation-0.395
true
Token true
Feature activation-1.621
,
Token,
Feature activation-0.016
though
Token though
Feature activation+0.000
perspective
Token perspective
Feature activation-0.336
.
Token.
Feature activation+0.029
"
Token "
Feature activation-0.766
What
TokenWhat
Feature activation-0.084
is
Token is
Feature activation-0.235
true
Token true
Feature activation+0.164
,
Token,
Feature activation+0.067
though
Token though
Feature activation-1.452
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
,
Token,
Feature activation-0.065
is
Token is
Feature activation-0.208
that
Token that
Feature activation-0.324
whatever
Token whatever
Feature activation-0.088
policy
Token policy
Feature activation-0.095
prescriptions
Token prescriptions
Feature activation+2.152
that
Token that
Feature activation-0.325
we
Token we
Feature activation-0.045
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-8.027
perspective
Token perspective
Feature activation-0.090
.
Token.
Feature activation-0.008
"
Token "
Feature activation+0.401
What
TokenWhat
Feature activation+0.132
is
Token is
Feature activation-0.099
true
Token true
Feature activation-0.461
,
Token,
Feature activation-0.004
though
Token though
Feature activation-0.319
<|endoftext|>
Token<|endoftext|>
Feature activation-8.563
perspective
Token perspective
Feature activation-0.136
.
Token.
Feature activation-0.136
"
Token "
Feature activation+0.156
What
TokenWhat
Feature activation+0.439
is
Token is
Feature activation-0.352
true
Token true
Feature activation-0.546
,
Token,
Feature activation+0.008
though
Token though
Feature activation-0.303
,
Token,
Feature activation-0.133
true
Token true
Feature activation-0.764
,
Token,
Feature activation-0.098
though
Token though
Feature activation-0.420
,
Token,
Feature activation-0.221
is
Token is
Feature activation-0.189
that
Token that
Feature activation+0.372
whatever
Token whatever
Feature activation-0.120
policy
Token policy
Feature activation+0.120
prescriptions
Token prescriptions
Feature activation+0.360
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
,
Token,
Feature activation-0.081
is
Token is
Feature activation-0.075
that
Token that
Feature activation-0.304
whatever
Token whatever
Feature activation-0.049
policy
Token policy
Feature activation-0.015
prescriptions
Token prescriptions
Feature activation+2.223
that
Token that
Feature activation-0.151
we
Token we
Feature activation+0.177
've
Token've
Feature activation+0.098
been
Token been
Feature activation-0.664
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.104
is
Token is
Feature activation-0.121
that
Token that
Feature activation-0.245
whatever
Token whatever
Feature activation-0.042
policy
Token policy
Feature activation+0.087
prescriptions
Token prescriptions
Feature activation+2.426
that
Token that
Feature activation-0.201
we
Token we
Feature activation+0.137
've
Token've
Feature activation-0.424
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.112
is
Token is
Feature activation-0.271
that
Token that
Feature activation-0.538
whatever
Token whatever
Feature activation-0.257
policy
Token policy
Feature activation+0.340
prescriptions
Token prescriptions
Feature activation+1.014
that
Token that
Feature activation-0.157
we
Token we
Feature activation-0.100
've
Token've
Feature activation+0.163
been
Token been
Feature activation-0.052
proposing
Token proposing
Feature activation+0.861
that
Token that
Feature activation-0.073
we
Token we
Feature activation-0.014
've
Token've
Feature activation+0.097
been
Token been
Feature activation-0.035
proposing
Token proposing
Feature activation-0.041
don
Token don
Feature activation+0.844
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.957
perspective
Token perspective
Feature activation-0.048
.
Token.
Feature activation+0.096
"
Token "
Feature activation+0.576
What
TokenWhat
Feature activation-0.050
is
Token is
Feature activation-0.267
true
Token true
Feature activation-0.936
,
Token,
Feature activation-0.047
though
Token though
Feature activation-0.295
<|endoftext|>
Token<|endoftext|>
Feature activation-7.953
perspective
Token perspective
Feature activation-0.178
.
Token.
Feature activation+0.009
"
Token "
Feature activation+1.060
What
TokenWhat
Feature activation-0.237
is
Token is
Feature activation-0.169
true
Token true
Feature activation-0.811
,
Token,
Feature activation-0.129
though
Token though
Feature activation-0.427
<|endoftext|>
Token<|endoftext|>
Feature activation-7.702
perspective
Token perspective
Feature activation-0.578
.
Token.
Feature activation-0.122
"
Token "
Feature activation+0.350
What
TokenWhat
Feature activation-0.264
is
Token is
Feature activation-0.119
true
Token true
Feature activation+0.228
,
Token,
Feature activation-0.000
though
Token though
Feature activation-1.208
<|endoftext|>
Token<|endoftext|>
Feature activation-7.791
perspective
Token perspective
Feature activation-0.194
.
Token.
Feature activation+0.028
"
Token "
Feature activation+0.648
What
TokenWhat
Feature activation-0.017
is
Token is
Feature activation-0.051
true
Token true
Feature activation-1.526
,
Token,
Feature activation+0.061
though
Token though
Feature activation-0.733

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.09

Head 2: 0.08

Head 3: 0.08

Head 4: 0.09

Head 5: 0.08

Head 6: 0.10

Head 7: 0.08

Head 8: 0.08

Head 9: 0.08

Head 10: 0.09

Head 11: 0.08

Positive logits

etsk3.76

ococ3.04

Pont2.95

levard2.74

Yel2.73

ça2.66

delim2.65

Sao2.65

Rouge2.61

Brazil2.60

Ll2.56

lineback2.53

\\\\\\\\2.51

Riv2.49

court2.45

Outer2.43

Rouse2.42

Pont2.41

Portug2.41

Ax2.41

Negative logits

ment-2.79

kettle-2.74

apon-2.65

hay-2.61

-2.58

sunscreen-2.53

recruitment-2.51

fortun-2.49

mentation-2.48

HAR-2.46

resil-2.45

ITCH-2.45

conditioning-2.42

cumulative-2.41

hair-2.40

-2.36

shampoo-2.35

Genie-2.35

femin-2.34

و-2.34

INTERVAL 0.000 - 0.000
CONTAINS 100.000%

robbery
Token robbery
Feature activation+0.000
gone
Token gone
Feature activation+0.000
bad
Token bad
Feature activation+0.000
or
Token or
Feature activation+0.000
whether
Token whether
Feature activation+0.000
they
Token they
Feature activation+0.000
were
Token were
Feature activation+0.000
paid
Token paid
Feature activation+0.000
assassins
Token assassins
Feature activation+0.000
.
Token .
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
new
Token new
Feature activation+0.000
model
Token model
Feature activation+0.000
looks
Token looks
Feature activation+0.000
nearly
Token nearly
Feature activation+0.000
identical
Token identical
Feature activation+0.000
to
Token to
Feature activation+0.000
its
Token its
Feature activation+0.000
predecessor
Token predecessor
Feature activation+0.000
,
Token,
Feature activation+0.000
but
Token but
Feature activation+0.000
instead
Token instead
Feature activation+0.000
and
Token and
Feature activation+0.000
satellite
Token satellite
Feature activation+0.000
imagery
Token imagery
Feature activation+0.000
via
Token via
Feature activation+0.000
Wik
Token Wik
Feature activation+0.000
im
Tokenim
Feature activation+0.000
ap
Tokenap
Feature activation+0.000
ia
Tokenia
Feature activation+0.000
,
Token,
Feature activation+0.000
Google
Token Google
Feature activation+0.000
Earth
Token Earth
Feature activation+0.000
greater
Token greater
Feature activation+0.000
triumph
Token triumph
Feature activation+0.000
.
Token.
Feature activation+0.000
Your
Token Your
Feature activation+0.000
absence
Token absence
Feature activation+0.000
from
Token from
Feature activation+0.000
the
Token the
Feature activation+0.000
field
Token field
Feature activation+0.000
made
Token made
Feature activation+0.000
you
Token you
Feature activation+0.000
even
Token even
Feature activation+0.000
âĢ
TokenâĢ
Feature activation+0.000
Ŀ
TokenĿ
Feature activation+0.000
was
Token was
Feature activation+0.000
incredibly
Token incredibly
Feature activation+0.000
trans
Token trans
Feature activation+0.000
ph
Tokenph
Feature activation+0.000
obic
Tokenobic
Feature activation+0.000
,
Token,
Feature activation+0.000
Three
Token Three
Feature activation+0.000
Ireland
Token Ireland
Feature activation+0.000
tweeted
Token tweeted
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Feature 29: Ultra low frequency cluster

TOP ACTIVATIONS
MAX = 0.012

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+0.012
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000

Top DFA by src position
MAX = 14.830

the
Token the
Feature activation-0.024
team
Token team
Feature activation-0.007
has
Token has
Feature activation+0.005
signed
Token signed
Feature activation+0.006
forward
Token forward
Feature activation+0.025
Tyson
Token Tyson
Feature activation+14.830
J
Token J
Feature activation-0.160
ost
Tokenost
Feature activation-0.217
(
Token (
Feature activation-0.028
J
TokenJ
Feature activation-0.034
OH
TokenOH
Feature activation-0.062
<|endoftext|>
Token<|endoftext|>
Feature activation-5.341
perspective
Token perspective
Feature activation-0.063
.
Token.
Feature activation+0.024
"
Token "
Feature activation+0.115
What
TokenWhat
Feature activation-0.422
is
Token is
Feature activation-0.180
true
Token true
Feature activation-0.499
,
Token,
Feature activation-0.019
though
Token though
Feature activation-0.209
whatever
Token whatever
Feature activation-0.206
policy
Token policy
Feature activation-0.052
prescriptions
Token prescriptions
Feature activation-0.218
that
Token that
Feature activation-0.062
we
Token we
Feature activation-0.006
've
Token've
Feature activation+0.077
been
Token been
Feature activation-0.028
proposing
Token proposing
Feature activation-0.033
don
Token don
Feature activation-4.454
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation-0.084
is
Token is
Feature activation-0.058
that
Token that
Feature activation+0.139
whatever
Token whatever
Feature activation-0.336
policy
Token policy
Feature activation-0.237
prescriptions
Token prescriptions
Feature activation+0.143
that
Token that
Feature activation-0.021
we
Token we
Feature activation-0.004
've
Token've
Feature activation+0.058
been
Token been
Feature activation-0.057
proposing
Token proposing
Feature activation-0.173
<|endoftext|>
Token<|endoftext|>
Feature activation-7.333
perspective
Token perspective
Feature activation-0.202
.
Token.
Feature activation-0.129
"
Token "
Feature activation+0.106
What
TokenWhat
Feature activation-0.324
is
Token is
Feature activation+0.320
true
Token true
Feature activation-0.740
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-7.573
perspective
Token perspective
Feature activation-0.168
.
Token.
Feature activation-0.032
"
Token "
Feature activation+0.537
What
TokenWhat
Feature activation-0.287
is
Token is
Feature activation+0.056
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-5.972
perspective
Token perspective
Feature activation-0.397
.
Token.
Feature activation+0.083
"
Token "
Feature activation+0.862
What
TokenWhat
Feature activation-0.766
is
Token is
Feature activation-0.329
true
Token true
Feature activation-1.523
,
Token,
Feature activation-0.092
though
Token though
Feature activation+0.000
,
Token,
Feature activation-0.194
is
Token is
Feature activation-0.116
that
Token that
Feature activation-0.274
whatever
Token whatever
Feature activation-0.239
policy
Token policy
Feature activation-0.033
prescriptions
Token prescriptions
Feature activation+0.152
that
Token that
Feature activation-0.084
we
Token we
Feature activation-0.020
've
Token've
Feature activation+0.015
been
Token been
Feature activation-0.055
proposing
Token proposing
Feature activation+0.070
,
Token,
Feature activation-0.091
is
Token is
Feature activation-0.067
that
Token that
Feature activation-0.339
whatever
Token whatever
Feature activation-0.128
policy
Token policy
Feature activation+0.320
prescriptions
Token prescriptions
Feature activation+2.586
that
Token that
Feature activation-0.222
we
Token we
Feature activation+0.076
've
Token've
Feature activation-0.337
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.173
is
Token is
Feature activation-0.132
that
Token that
Feature activation-0.044
whatever
Token whatever
Feature activation-0.393
policy
Token policy
Feature activation+0.208
prescriptions
Token prescriptions
Feature activation+0.500
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.500
perspective
Token perspective
Feature activation-0.019
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.112
What
TokenWhat
Feature activation-0.231
is
Token is
Feature activation-0.164
true
Token true
Feature activation-0.611
,
Token,
Feature activation-0.000
though
Token though
Feature activation-0.231
.
Token.
Feature activation-0.036
"
Token "
Feature activation-0.014
What
TokenWhat
Feature activation-0.094
is
Token is
Feature activation-0.230
true
Token true
Feature activation-0.259
,
Token,
Feature activation+0.020
though
Token though
Feature activation-0.238
,
Token,
Feature activation-0.094
is
Token is
Feature activation-0.106
that
Token that
Feature activation-0.225
whatever
Token whatever
Feature activation-0.131
,
Token,
Feature activation-0.092
is
Token is
Feature activation-0.029
that
Token that
Feature activation-0.248
whatever
Token whatever
Feature activation-0.116
policy
Token policy
Feature activation+0.237
prescriptions
Token prescriptions
Feature activation+2.303
that
Token that
Feature activation-0.177
we
Token we
Feature activation+0.067
've
Token've
Feature activation+0.002
been
Token been
Feature activation-0.483
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.139
is
Token is
Feature activation-0.092
that
Token that
Feature activation-0.360
whatever
Token whatever
Feature activation-0.139
policy
Token policy
Feature activation+0.125
prescriptions
Token prescriptions
Feature activation+2.641
that
Token that
Feature activation-0.257
we
Token we
Feature activation-0.082
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
,
Token,
Feature activation-0.167
is
Token is
Feature activation-0.206
that
Token that
Feature activation-0.612
whatever
Token whatever
Feature activation-0.313
policy
Token policy
Feature activation+0.485
prescriptions
Token prescriptions
Feature activation+0.856
that
Token that
Feature activation-0.140
we
Token we
Feature activation-0.043
've
Token've
Feature activation-0.012
been
Token been
Feature activation-0.094
proposing
Token proposing
Feature activation+0.102
.
Token.
Feature activation-0.039
"
Token "
Feature activation+0.003
What
TokenWhat
Feature activation-0.590
is
Token is
Feature activation-0.135
true
Token true
Feature activation-0.601
,
Token,
Feature activation+0.003
though
Token though
Feature activation-0.951
,
Token,
Feature activation-0.689
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
,
Token,
Feature activation-0.114
is
Token is
Feature activation-0.083
that
Token that
Feature activation-0.153
whatever
Token whatever
Feature activation-0.239
policy
Token policy
Feature activation+0.028
prescriptions
Token prescriptions
Feature activation+0.399
that
Token that
Feature activation-0.196
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation-6.508
perspective
Token perspective
Feature activation-0.108
.
Token.
Feature activation-0.013
"
Token "
Feature activation+0.321
What
TokenWhat
Feature activation-0.276
is
Token is
Feature activation-0.023
true
Token true
Feature activation-0.530
,
Token,
Feature activation+0.010
though
Token though
Feature activation-0.330
.
Token.
Feature activation-0.114
"
Token "
Feature activation-2.485
What
TokenWhat
Feature activation-0.462
is
Token is
Feature activation-0.141
true
Token true
Feature activation-0.656
,
Token,
Feature activation+0.008
though
Token though
Feature activation-1.257
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
.
Token.
Feature activation-0.106
"
Token "
Feature activation-0.243
What
TokenWhat
Feature activation-0.360
is
Token is
Feature activation+0.003
true
Token true
Feature activation-1.393
,
Token,
Feature activation+0.045
though
Token though
Feature activation-0.571
,
Token,
Feature activation-0.302
is
Token is
Feature activation-0.313
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000

Decoder Weights Distribution

Head 0: 0.08

Head 1: 0.09

Head 2: 0.07

Head 3: 0.08

Head 4: 0.09

Head 5: 0.08

Head 6: 0.07

Head 7: 0.08

Head 8: 0.09

Head 9: 0.09

Head 10: 0.09

Head 11: 0.08

Positive logits

baugh3.09

Yen3.03

bil3.00

Gel2.96

ngth2.96

HAEL2.83

Rye2.80

Fein2.74

ende2.66

holm2.59

Hills2.53

Sul2.50

tan2.47

Brill2.46

skin2.45

leeve2.44

bin2.43

Shooter2.41

iasis2.41

ayne2.41

Negative logits

paralle-3.13

soDeliveryDate-2.82

lov-2.74

Rouse-2.59

equals-2.57

Georgia-2.57

overtake-2.53

Georg-2.47

friendly-2.42

guiActive-2.37

mistress-2.37

regist-2.37

beat-2.36

Nato-2.33

force-2.33

maid-2.30

Ultron-2.28

DCS-2.28

ORS-2.28

Dom-2.27

INTERVAL 0.011 - 0.012
CONTAINS 0.000%

Joe
Token Joe
Feature activation+0.000
Sak
Token Sak
Feature activation+0.000
ic
Tokenic
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
T
TokenT
Feature activation+0.012
yson
Tokenyson
Feature activation+0.000
is
Token is
Feature activation+0.000
an
Token an
Feature activation+0.000
all
Token all
Feature activation+0.000
-
Token-
Feature activation+0.000

INTERVAL 0.009 - 0.011
CONTAINS 0.000%

INTERVAL 0.008 - 0.009
CONTAINS 0.000%

INTERVAL 0.007 - 0.008
CONTAINS 0.000%

INTERVAL 0.006 - 0.007
CONTAINS 0.000%

INTERVAL 0.005 - 0.006
CONTAINS 0.000%

INTERVAL 0.004 - 0.005
CONTAINS 0.000%

INTERVAL 0.002 - 0.004
CONTAINS 0.000%

INTERVAL 0.001 - 0.002
CONTAINS 0.000%

INTERVAL 0.000 - 0.001
CONTAINS 100.000%

pper
Tokenpper
Feature activation+0.000
Cr
Token Cr
Feature activation+0.000
ust
Tokenust
Feature activation+0.000
,
Token,
Feature activation+0.000
New
Token New
Feature activation+0.000
Mil
Token Mil
Feature activation+0.000
ford
Tokenford
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Ċ
TokenĊ
Feature activation+0.000
Wal
TokenWal
Feature activation+0.000
rus
Tokenrus
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
$
Token $
Feature activation+0.000
25
Token25
Feature activation+0.000
per
Token per
Feature activation+0.000
car
Token car
Feature activation+0.000
wash
Token wash
Feature activation+0.000
to
Token to
Feature activation+0.000
make
Token make
Feature activation+0.000
a
Token a
Feature activation+0.000
profit
Token profit
Feature activation+0.000
âĢĶ
Token âĢĶ
Feature activation+0.000
her
Token her
Feature activation+0.000
character
Token character
Feature activation+0.000
next
Token next
Feature activation+0.000
season
Token season
Feature activation+0.000
.
Token.
Feature activation+0.000
Joe
Token Joe
Feature activation+0.000
Henderson
Token Henderson
Feature activation+0.000
confirmed
Token confirmed
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
would
Token would
Feature activation+0.000
wa
Tokenwa
Feature activation+0.000
an
Tokenan
Feature activation+0.000
Rand
Token Rand
Feature activation+0.000
le
Tokenle
Feature activation+0.000
-
Token-
Feature activation+0.000
El
TokenEl
Feature activation+0.000
and
Token and
Feature activation+0.000
has
Token has
Feature activation+0.000
a
Token a
Feature activation+0.000
proven
Token proven
Feature activation+0.000
track
Token track
Feature activation+0.000
truck
Token truck
Feature activation+0.000
of
Token of
Feature activation+0.000
masked
Token masked
Feature activation+0.000
burgl
Token burgl
Feature activation+0.000
ars
Tokenars
Feature activation+0.000
as
Token as
Feature activation+0.000
they
Token they
Feature activation+0.000
fled
Token fled
Feature activation+0.000
a
Token a
Feature activation+0.000
robbery
Token robbery
Feature activation+0.000
led
Token led
Feature activation+0.000

BOTTOM ACTIVATIONS
MIN = 0.000

proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
the
Token the
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
,
Token,
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
heard
Token heard
Feature activation+0.000
by
Token by
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
<|endoftext|>
Token<|endoftext|>
Feature activation+0.000
perspective
Token perspective
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
.
Token.
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
"
Token "
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
been
Token been
Feature activation+0.000
proposing
Token proposing
Feature activation+0.000
don
Token don
Feature activation+0.000
't
Token't
Feature activation+0.000
reach
Token reach
Feature activation+0.000
,
Token,
Feature activation+0.000
are
Token are
Feature activation+0.000
not
Token not
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
've
Token've
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000
we
Token we
Feature activation+0.000
What
TokenWhat
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
is
Token is
Feature activation+0.000
true
Token true
Feature activation+0.000
,
Token,
Feature activation+0.000
though
Token though
Feature activation+0.000
,
Token,
Feature activation+0.000
is
Token is
Feature activation+0.000
that
Token that
Feature activation+0.000
whatever
Token whatever
Feature activation+0.000
policy
Token policy
Feature activation+0.000
prescriptions
Token prescriptions
Feature activation+0.000
that
Token that
Feature activation+0.000