Forum: >>> Magnum BBS <<<

a common control group, how to perform statistical analysis.

From Jinsong Zhao@21:1/5 to All on Sat Dec 10 12:19:51 2022

Hi there,

In an incubation experiment, we want to test the extra added plastic on
soil properties. We have two factors. One is the age of plastic (a),
which has 3 levels, i.e., a0, a10, and a30; and the other one is the
applied rate (r), i.e., r0, r2, and r20. We plan to use a randomized
complete design and have 9 treatments with 3 replications for each
treatment.

The fact is r0a0, r0a10, and r0a30 are the same. They are treatments
with no added plastic. So we want to reduce those three treatments to
one. Therefore, we have 7 other than 9 treatments.

Now, we have problems performing statistical analysis. Can we use
two-way ANOVA to check the main and interaction effects of the two
factors? If yes, what's we have to pay attention to?

We really appreciate any suggestion or hint. Thanks in advance.

Best,
Jinsong

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From David Jones@21:1/5 to Jinsong Zhao on Sat Dec 10 09:44:18 2022

Jinsong Zhao wrote:

Hi there,

In an incubation experiment, we want to test the extra added plastic
on soil properties. We have two factors. One is the age of plastic
(a), which has 3 levels, i.e., a0, a10, and a30; and the other one is
the applied rate (r), i.e., r0, r2, and r20. We plan to use a
randomized complete design and have 9 treatments with 3 replications
for each treatment.

The fact is r0a0, r0a10, and r0a30 are the same. They are treatments
with no added plastic. So we want to reduce those three treatments to
one. Therefore, we have 7 other than 9 treatments.

Now, we have problems performing statistical analysis. Can we use
two-way ANOVA to check the main and interaction effects of the two
factors? If yes, what's we have to pay attention to?

We really appreciate any suggestion or hint. Thanks in advance.

Best,
Jinsong

As you presumably know, ANOVA is just a special case of a linear
regression type of analysis. Your situation falls slightly outside of
that special case. My suggestion is that you look at the theory of
where ANOVA fits into its regression-type of background, in terms of regression-model structure, see what changes you need to make to that model-structure, and then proceed with fitting (and if necessary,
formal testing) of that regression model.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Rich Ulrich@21:1/5 to dajhawkxx@nowherel.com on Sat Dec 10 15:40:37 2022

On Sat, 10 Dec 2022 09:44:18 -0000 (UTC), "David Jones" <dajhawkxx@nowherel.com> wrote:

Jinsong Zhao wrote:

Hi there,

In an incubation experiment, we want to test the extra added plastic
on soil properties. We have two factors. One is the age of plastic
(a), which has 3 levels, i.e., a0, a10, and a30; and the other one is
the applied rate (r), i.e., r0, r2, and r20. We plan to use a
randomized complete design and have 9 treatments with 3 replications
for each treatment.

The fact is r0a0, r0a10, and r0a30 are the same. They are treatments
with no added plastic. So we want to reduce those three treatments to
one. Therefore, we have 7 other than 9 treatments.

Now, we have problems performing statistical analysis. Can we use
two-way ANOVA to check the main and interaction effects of the two
factors? If yes, what's we have to pay attention to?

We really appreciate any suggestion or hint. Thanks in advance.

Best,
Jinsong

As you presumably know, ANOVA is just a special case of a linear
regression type of analysis. Your situation falls slightly outside of
that special case. My suggestion is that you look at the theory of
where ANOVA fits into its regression-type of background, in terms of >regression-model structure, see what changes you need to make to that >model-structure, and then proceed with fitting (and if necessary,
formal testing) of that regression model.

Yes. Using regression is what I would try, for all the groups, coding
contrasts as needed.

Here are a couple of other thoughts.

If the factors are strong, you may get most of your useful
information from an ANOVA omitting r0 entirely, at the start. Plan
to figure out what that "baseline" means, after you plot out the rest.
If the baseline is important, perhaps that single group should
be larger than the others. Duncan's procedure for a single control
versus multiple groups recommends a larger N based on the number
of other groups -- I don't know if yours should be thought of as "6"
(with replications=3) or as "2", the number of groups in the other
contrasts (with replications = 6). I don't remember adapting Duncan's
to a two-way design before.

Age and Rate are both quantitative, which implies that a single d.f.
contrast for "linear" would be the powerful test. However, your
scaling for the regression contrasts is not linear in an obvious way,
either for (0, 2, 20) or for (0,10,30). - Often, the arbitrary-seeming
numbers have been chosen because the PI /expects/ those to be
equal-interval steps, in which case the simple (-1, 0, 1) works.

Your contrasts when including interactions will have correlations,
so the regression results that you look at for main effects should
/not/ include the interaction effects.

--
Rich Ulrich

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Jinsong Zhao@21:1/5 to Rich Ulrich on Sun Dec 11 21:48:27 2022

On 2022/12/11 4:40, Rich Ulrich wrote:

On Sat, 10 Dec 2022 09:44:18 -0000 (UTC), "David Jones" <dajhawkxx@nowherel.com> wrote:

Jinsong Zhao wrote:

Hi there,

In an incubation experiment, we want to test the extra added plastic
on soil properties. We have two factors. One is the age of plastic
(a), which has 3 levels, i.e., a0, a10, and a30; and the other one is
the applied rate (r), i.e., r0, r2, and r20. We plan to use a
randomized complete design and have 9 treatments with 3 replications
for each treatment.

The fact is r0a0, r0a10, and r0a30 are the same. They are treatments
with no added plastic. So we want to reduce those three treatments to
one. Therefore, we have 7 other than 9 treatments.

Now, we have problems performing statistical analysis. Can we use
two-way ANOVA to check the main and interaction effects of the two
factors? If yes, what's we have to pay attention to?

We really appreciate any suggestion or hint. Thanks in advance.

Best,
Jinsong

As you presumably know, ANOVA is just a special case of a linear
regression type of analysis. Your situation falls slightly outside of
that special case. My suggestion is that you look at the theory of
where ANOVA fits into its regression-type of background, in terms of
regression-model structure, see what changes you need to make to that
model-structure, and then proceed with fitting (and if necessary,
formal testing) of that regression model.

Yes. Using regression is what I would try, for all the groups, coding contrasts as needed.

Here are a couple of other thoughts.

If the factors are strong, you may get most of your useful
information from an ANOVA omitting r0 entirely, at the start. Plan
to figure out what that "baseline" means, after you plot out the rest.
If the baseline is important, perhaps that single group should
be larger than the others. Duncan's procedure for a single control
versus multiple groups recommends a larger N based on the number
of other groups -- I don't know if yours should be thought of as "6"
(with replications=3) or as "2", the number of groups in the other
contrasts (with replications = 6). I don't remember adapting Duncan's
to a two-way design before.

Age and Rate are both quantitative, which implies that a single d.f.
contrast for "linear" would be the powerful test. However, your
scaling for the regression contrasts is not linear in an obvious way,
either for (0, 2, 20) or for (0,10,30). - Often, the arbitrary-seeming numbers have been chosen because the PI /expects/ those to be
equal-interval steps, in which case the simple (-1, 0, 1) works.

Your contrasts when including interactions will have correlations,
so the regression results that you look at for main effects should
/not/ include the interaction effects.

Thanks a lot for the reply and instructions. I do not understand well
about coding contrasts. My knowledge of statistics, which I only learned introductory statistics during my college, is very limited. If you could
point me to some textbooks about this kind of statistical analysis, I
would really appreciate it.

Best,
Jinsong

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Rich Ulrich@21:1/5 to All on Thu Dec 15 00:46:16 2022

On Sun, 11 Dec 2022 21:48:27 +0800, Jinsong Zhao <jszhao@yeah.net>
wrote:

(me)

< snip stuff >

Your contrasts when including interactions will have correlations,
so the regression results that you look at for main effects should
/not/ include the interaction effects.

Thanks a lot for the reply and instructions. I do not understand well
about coding contrasts. My knowledge of statistics, which I only learned >introductory statistics during my college, is very limited. If you could >point me to some textbooks about this kind of statistical analysis, I
would really appreciate it.

Okay, you have had one intro course in statistics. Did you take a
lot of math, including matrix theory? I did. Basic calculus
certainly helped me understand distributions and tests. In my
first job, as a computer programmer, I was mentored by a
statistician -- but I was able to help HIM with, for instance,
canonical correlation and factor analysis, which are 'merely'
particular manipulations of matrices. (And regression is one
simple case of canonical correlation.)

The intro-to-stats that I took in psychology was the most worthless
course of my career, for content. However, it partly prepared me for consulting with psychologists who know no math.

I don't have any idea how much you know. It would be
malpractice of a sort to point you at a text and say, Good Luck.

There are SO MANY ways to go wrong.

Maybe read up first, in some intro-to-regression book in your
area. After you have read enough to understand the language,
hire a consultant for /few/ hours of consulting. Let them design
and run the analyses, and interpret them.

Ask questions; float your ideas, and see how close you came;
ask THEM for what texts they recommend.

Having worked as a statistical consultant, I can say that the
time-consuming part of the task is often "cleaning up the data."
Missing values? Out of range values? - What do you want to do?
You should be able to clean the data, and quickly show them
you have done that step. Plots and graphs of bivariate
distributions are good for providing assurance that there are
no hidden problems.

- Professors at universities are apt to be willing to moonlight.
Someone who has published relevant research would be best.
A theorist who has never dealt with real data might be terrible.

(I'm recalling a fine theorist who made several fine posts here,
back during the Bush administration. Reef_fish Bob was
autistic-spectrum: bad at context, and hostile to people as
a reflex.)

--
Rich Ulrich

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	546
Nodes:	16 (2 / 14)
Uptime:	150:59:34
Calls:	10,383
Files:	14,054
Messages:	6,417,797

a common control group, how to perform statistical analysis.

Who's Online

System Info