Mid Continent Steel & Wire, Inc. v. United States

Slip Op. 23-45 UNITED STATES COURT OF INTERNATIONAL TRADE MID CONTINENT STEEL & WIRE, INC., Plaintiff and Consolidated Defendant-Intervenor, v. UNITED STATES, Before: Claire R. Kelly, Judge Defendant, Consol. Court No. 15-00213 and PT ENTERPRISE, INC. ET AL., Defendant-Intervenors and Consolidated Plaintiffs. OPINION AND ORDER [Remanding the U.S. Department of Commerce’s third remand redetermination in its antidumping duty investigation of certain steel nails from Taiwan.] Dated: April 3, 2023 Adam H. Gordon, Jennifer M. Smith, and Lauren Fraid, The Bristol Group PLLC of Washington, D.C., for plaintiff and consolidated defendant-intervenor Mid Continent Steel & Wire, Inc. Ned H. Marshak, Andrew T. Schutz, and Max F. Schutzman, Grunfeld Desiderio Lebowitz Silverman & Klestadt LLP of Washington, D.C., and New York, N.Y., for consolidated plaintiffs and defendant-intervenors PT Enterprise, Inc., Pro-Team Coil Nail Enterprise Inc., Unicatch Industrial Co., Ltd., WTA International Co., Ltd., Zon Mon Co., Ltd., Hor Liang Industrial Corp., President Industrial Inc., and Liang Chyuan Industrial Co., Ltd. Consol. Court No. 15-00213 Page 2 Mikki Cottet, Senior Trial Counsel, Commercial Litigation Branch, Civil Division, U.S. Department of Justice, of Washington, D.C., for defendant. Also on the brief were Brian M. Boynton, Principal Deputy Assistant Attorney General, and Patricia M. McCarthy, Director. Of counsel was Vania Wang, Attorney, Office of the Chief Counsel for Trade Enforcement & Compliance, U.S. Department of Commerce, of Washington, D.C. Kelly, Judge: Before the court is the U.S. Department of Commerce’s (“Commerce”) third remand redetermination in the antidumping duty investigation of certain steel nails from Taiwan, in accordance with the mandate of the Court of Appeals for the Federal Circuit in Mid Continent Steel & Wire, Inc. v. United States, 31 F.4th 1367 (Fed. Cir. 2022) rev’g in part 945 F. Supp. 3d 1298 (Ct. Int’l Tr. 2021). The Court of Appeals vacated and remanded for Commerce to reconsider or further explain its use of a simple average as the denominator of the Cohen’s d test, as part of Commerce’s differential pricing analysis. See Mandate, June 13, 2022, ECF No. 177; Remand Order, June 14, 2022, ECF No. 178. On remand, Commerce again asserts that its use of simple averaging is supported by statistical literature. See Final Results of Redetermination Purs. Ct. Remand, Nov. 10, 2022, ECF No. 186-1. For the following reasons, the court remands Commerce’s third remand redetermination for further explanation or reconsideration. BACKGROUND The court presumes familiarity with the facts of this case from this court’s previous opinions, as well as the Court of Appeals’ decision in Mid Continent V, and now recounts only the facts relevant to the court’s review of the Remand Results. On June 25, 2014, Commerce initiated an antidumping duty investigation of certain steel Consol. Court No. 15-00213 Page 3 nails from six countries, including Taiwan. See Certain Steel Nails from India, the Republic of Korea, Malaysia, the Sultanate of Oman, Taiwan, the Republic of Turkey, and the Socialist Republic of Vietnam, 79 Fed. Reg. 36,019 (Dep’t Commerce June 25, 2014) (initiation of less-than-fair-value investigations). On May 20, 2015, Commerce issued its final determination, which resulted in an antidumping duty order on subject nails from Taiwan. See Certain Steel Nails from Taiwan, 80 Fed. Reg. 28,959 (Dep’t Commerce May 20, 2015) (final determination of sales at less than fair value) (“Final Results”) and accompanying Issues and Decision Memorandum, May 13, 2015, ECF No. 17 (“Final Decision Memo.”). On March 23, 2017, this court sustained Commerce’s determination, including its decision to use a simple average in the denominator of Cohen’s d test. See Mid Continent Steel & Wire, Inc. v. United States, 219 F. Supp. 3d 1161 (Ct. Int’l Tr. 2017) (“Mid Continent I”). On October 3, 2019, the Court of Appeals vacated this court’s judgment and remanded in part to Commerce for further explanation of its decision to use the simple average in Cohen’s d test. See Mid Continent Steel & Wire, Inc. v. United States, 940 F.3d 662 (Fed. Cir. 2019) (“Mid Continent III”). On remand, Commerce defended its decision to use the simple average, explaining that its use of the simple average was both accurate and in accord with statistical literature. See Final Results of Redetermination Purs. Ct. Remand, June 16, 2020, ECF No. 144-1 (“Second Remand Results”). On January 8, 2021, this court again sustained, concluding that Commerce had adequately explained how its use of simple averaging Consol. Court No. 15-00213 Page 4 was more accurate, and thus a reasonable choice of methodology. See Mid Continent Steel & Wire, Inc. v. United States, 945 F. Supp. 3d 1298 (Ct. Int’l Tr. 2021) (“Mid Continent IV”). On April 21, 2022, the Court of Appeals again vacated this court’s judgment, remanding to Commerce for further explanation of its decision to use the simple average. See Mid Continent Steel & Wire, Inc. v. United States, 31 F.4th 1367 (Fed. Cir. 2019) (“Mid Continent V”). On remand, Commerce again defends its decision to use the simple average with the Cohen’s d test, explaining that its usage is consistent with statistical literature. See Final Results of Redetermination Purs. Ct. Remand, Nov. 10, 2022, ECF No. 186-1 (“Remand Results”). Consolidated Plaintiffs and Defendant- intervenors PT Enterprise, Inc., et al. (“PT”) submitted comments asserting that Commerce’s use of the simple average is not supported by literature and resulted in increased dumping margins, as well as challenging Commerce’s decision to exclude certain of its submissions from the record. See [PT’s] Cmts. on Remand Results, Dec. 13, 2022, ECF No. 188 (“PT’s Cmts.”). Plaintiff and Consolidated Defendant- intervenor Mid Continent Steel & Wire, Inc. (“Mid Continent”) submitted comments supporting Commerce’s use of simple averaging. See [Mid Continent’s] Cmts. Supp. Remand Results, Feb. 13, 2023, ECF No. 191 (“Mid Continent’s Cmts.”). On February 27, 2023, PT moved for oral argument, see [PT’s] Mot. Oral Arg., Feb. 27, 2023, ECF Consol. Court No. 15-00213 Page 5 No. 198, and on March 21, 2023, Mid Continent submitted its response in opposition to oral argument. See Resp. Opp. Oral Arg., March 21, 2023, ECF No. 200.1 JURISDICTION AND STANDARD OF REVIEW The court has jurisdiction pursuant to 28 U.S.C. § 1581(c) (2018), which grants the court authority to review actions initiated under 19 U.S.C. § 1516a(a)(2)(B)(i) 2 contesting the final determination in an antidumping duty order. The court will uphold Commerce’s determination unless it is “unsupported by substantial evidence on the record, or otherwise not in accordance with law.” 19 U.S.C. § 1516a(b)(1)(B)(i). “The results of a redetermination pursuant to court remand are also reviewed ‘for compliance with the court’s remand order.’” Xinjiamei Furniture Co. v. United States, 968 F. Supp. 2d 1255, 1259 (Ct. Int’l Tr. 2014). DISCUSSION PT’s Rejected Submissions As a preliminary matter, PT argues that Commerce improperly rejected portions of its case brief as new factual information. PT’s Cmts. at 31. Commerce rejected a report authored by a statistical consultant for PT, which was submitted together with PT’s comments on the draft remand results. See Rejection Ltr., ECF No. 195, A-583-854, PRRD 15, bar code 4304452-01 (Oct. 25, 2022) (“Rejection Ltr.”); 1 In light of the court’s decision on the merits, PT’s motion for oral argument will be denied as moot. 2 Further citations to the Tariff Act of 1930, as amended, are to the relevant provisions of Title 19 of the U.S. Code, 2018 edition. Consol. Court No. 15-00213 Page 6 see also W.A. Huber Decl. Concerning Draft Results of Redetermination Pursuant to Court Remand, ECF No. 195, A-583-854, PRRD 11, bar code 4290765-02 (Sept. 30, 2022) (“Huber Decl.”). Commerce properly rejected the Huber Declaration. Commerce’s regulations require that it reject untimely-filed factual information unless the record is reopened or, where appropriate, an extension is sought. 19 C.F.R. § 351.302(d). “Factual information” includes, among other things, data or statements of fact in support of allegations. 19 C.F.R. § 351.102(b)(21)(ii). Expert reports submitted to Commerce “analyzing reported information clearly assume[] the weight of evidence and, as such, amount[] to [d]ata or statements of fact in support of allegations, i.e., factual information.” PSC VSMPO-Avisma Corp. v. United States, 688 F.3d 751, 760 (Fed. Cir. 2012) (internal quotation and citation omitted). Commerce did not reopen the record for new submissions on remand, see Defendant’s Resp. Cmts. on Remand Redetermination, 29–30, Feb. 13, 2023, ECF No. 192 (“Def.’s Reply”), and the Huber Declaration contains both the analysis and conclusions of PT’s statistical expert as to the validity of the remand results. See generally Huber Decl. Therefore, the report constitutes new factual information, which was properly rejected by Commerce. PT also challenges Commerce’s rejection of certain pages it submitted from Cohen, and references in its comments to two non-record academic sources. PT’s Cmts. at 32–33; Rejection Ltr. at 3. PT argues that these sources are not factual information, but are only references to information in the public realm. PT’s Cmts. Consol. Court No. 15-00213 Page 7 at 32–33. Defendant counters that academic literature is factual information which must be submitted as part of the record for Commerce to consider. Def.’s Reply at 31. Again, factual information includes “data or statements of fact in support of allegations,” 19 C.F.R. § 351.102(b)(21)(ii), and PT’s academic sources contain statistical information which PT cites to support its argument in favor of weighted averaging. See PT’s Cmts. at 32. Therefore, as with the Huber declaration, Commerce did not reopen the record for new submissions on remand, so Commerce properly rejected PT’s untimely submissions. Cohen’s d Test and the Simple Average In Mid Continent V, the Court of Appeals remanded to Commerce to further explain or reconsider its use of the simple average in the Cohen’s d test, in light of statistical literature suggesting that only use of a weighted average is appropriate. Mid Continent V, 31 F.4th at 1381. Commerce makes two arguments to support its use of the simple average. First, it argues that the literature supports using the simple average when the “full population” of data is considered. Remand Results at 14. Second, it argues that sample sizes are irrelevant, because they only affect “statistical significance.” Id. at 14–16. 3 PT counters that the literature only supports 3 Commerce also argues against the alternative weighting proposals advanced by PT and the Court of Appeals. Remand Results at 16–23. Because Commerce has not elected to use either method on remand, the court does not reach the relative merits of using either a single standard deviation or PT’s proposed equation. Moreover, even if the alternate proposals were fully supported by the literature, this support would not necessarily detract from Commerce’s choice of the simple average. See Motor Vehicle Mfrs. Ass’n of U.S. v. State Farm Mut. Auto. Ins. Co., 463 U.S. 29, 43 (1983). Consol. Court No. 15-00213 Page 8 weighted averaging, and that for Commerce’s purposes, analysis of full populations is no different from analysis of a sample. PT’s Cmts. at 5–18. For the following reasons, the court remands Commerce’s determination. When investigating whether merchandise is being sold at less than fair value, Commerce typically compares the weighted average of normal values with the weighted average of export prices for comparable merchandise, unless it determines another method is appropriate. 19 U.S.C. § 1677f-1(d)(1)(A)(i); 19 C.F.R. § 351.414(c)(1). To address concerns over “targeted dumping,” 4 Congress amended 19 U.S.C. § 1677f-1 to allow Commerce to compare “the weighted average of the normal values to export prices . . . of individual transactions for comparable merchandise if (i) there is a pattern of export prices . . . for comparable merchandise that differ significantly among purchasers, regions or periods of time, and (ii) [Commerce] explains why such differences cannot be taken into account [with another method].” 19 U.S.C. § 1677f-1(d)(1)(B)(i)–(ii). Congress has not specified how Commerce should determine whether a “pattern of significantly different prices” exists. 5 Therefore, the court determines whether Commerce’s methodology is 4 Targeted dumping occurs when an exporter sells at a lower, “dumped” price to particular customers or regions, while selling at higher prices to other customers or regions, such that the higher-priced products mask the dumped products by increasing the overall average price. See Apex Frozen Foods Priv. Ltd. v. United States, 862 F.3d 1337, 1341 (Fed. Cir. 2017). 5 The Statement of Administrative Action of the Uruguay Round Agreements Act (footnote continued) Consol. Court No. 15-00213 Page 9 reasonable in light of considerations that run counter to its decision. See Motor Vehicle Mfrs. Ass’n of U.S. v. State Farm Mut. Auto. Ins. Co., 463 U.S. 29, 43 (1983); Ceramica Regiomontana, S.A. v. United States, 636 F. Supp. 961, 966 (Ct. Int’l Tr. 1986), aff’d, 810 F.2d 1137, 1139 (Fed. Cir. 1987). See also, e.g., Stupp Corporation v. United States, 5 F.4th 1341, 1354 (Fed. Cir. 2021) (standard for reviewing components of Commerce’s differential pricing methodology is reasonableness) (citing Mid Continent III, 940 F.3d at 667). To implement § 1677f-1(d)(1)(B) Commerce performs a “differential pricing analysis” of a respondent’s sales. See Differential Pricing Analysis; Request for Comments, 79 Fed. Reg. 26,720, 26,722 (Dep’t of Commerce May 9, 2014). This analysis contains three tests—the Cohen’s d test, the ratio test, and the meaningful difference test. Only the Cohen’s d test, which determines whether there is a “pattern of significantly different prices,” is at issue in this case. As applied by Commerce, the Cohen’s d test involves comparing the prices of “test groups” of a respondent’s sales to a “comparison group” by region, purchaser, and time period. See id. at 26,722. For each category, Commerce segregates sales into subsets, with one subset becoming the test group, and the remaining subsets being combined as the comparison group. Id. Commerce then calculates the means explains that Commerce should proceed “on a case-by-case basis, because small differences may be significant for one industry or one type of product, but not for another.” Uruguay Round Agreements Act, Statement of Administrative Action, H.R. Doc. No. 103-316, vol. 1, at 842–43 (1994), reprinted in 1994 U.S.C.C.A.N. 4040, 4178. Consol. Court No. 15-00213 Page 10 and standard deviations of the test and comparison groups. Id. Commerce finally calculates a d coefficient by dividing the difference in the groups’ means by the square root of the average of the squared standard deviations of each group. 6 See, e.g., Xanthan Gum From the People’s Republic of China: Final Determination of Sales at Less Than Fair Value, 78 Fed. Reg. 33,350 (Dep’t of Commerce June 4, 2013). Commerce finds the average of the squared standard deviations by adding them together and dividing by two, referring to the result as a “simple average.” See id. Commerce does not account for the differences in the size of each group, i.e., use a “weighted average.” Commerce tests each subset against the remaining subsets across each category, and assigns a d coefficient by dividing the difference in the groups’ means by the square root of the average of the squared standard deviations of each group. If the d value of a test group is equal to or greater than the “large threshold,” or 0.8, the observations within that group are said to have “passed” the Cohen’s d test. Differential Pricing Analysis; Request for Comments, 79 Fed. Reg. 26,720, 26,722 (Dep’t of Commerce May 9, 2014). If a sufficient quantity of sales by volume pass Cohen’s d test, Commerce may compare the export prices of individual transactions 6 Thus, d = |݉஺ െ ݉஻ | / ඥ(ɐଶ஺ + ɐଶ஻ )Τ2, where |݉஺ െ ݉஻ | is the absolute value of the difference in means between the test and comparison groups, and ɐ஺ଶ + ɐଶ஻ is the sum of the squared standard deviation of both groups. Standard deviation squared (ɐଶ ) is also referred to as “variance.” Commerce’s formulation of what it calls the Cohen’s d test is also known as Cohen’s equation (2.3.2). See Cohen, Jacob, Statistical Power Analysis for the Behavioral Sciences, 44, (2nd ed. 1988), A-580-876, PRRD 8, bar code 4181776-01 (Nov. 12, 2021) (“Cohen”). Consol. Court No. 15-00213 Page 11 to normal value, instead of comparing the average export prices to normal value. Id. at 27,622–23. In Mid Continent V, the Court of Appeals held that Commerce inadequately explained its choice of the simple average over the weighted average in constructing the Cohens’ d denominator, given that the literature relied upon by Commerce only supports weighted averaging. Mid Continent V, 31 F.4th at 1378–81. Although Commerce is not ordinarily bound to follow published literature, because it justified its methodology by relying on that literature, the Court held that Commerce needed to justify its departure from established statistical practice. 7 Id. As the Court of Appeals explained, Commerce’s methodology must reasonably implement its statutory mandate of determining when prices of certain groups differ significantly. Id. at 1580–81; see also Motor Vehicle Mfrs. Ass’n, 463 U.S. 29, 43; Ceramica Regiomontana, 636 F. Supp. 961, 966. The Court of Appeals concluded that Commerce’s methodology departed from the acknowledged literature. See Mid Continent V, 31 F.4th at 1381. 7 Reviewing the literature, the Court of Appeals referenced affirmative language in Cohen, Coe, and Ellis which supports the use of a weighted average. See Mid Continent V, 31 F.4th at 1378; see also Cohen at 67; Ellis, Paul, The Essential Guide to Effect Sizes: Statistical Power, Meta-Analysis, and the Interpretation of Research Results, 26–27, (2010), A-580-876, PRRD 8, bar code 4181776-01 (Nov. 12, 2021) (“Ellis”); Coe, Robert, It’s the Effect Size: Stupid: What Effect Size Is and Why It Is Important, 6, (September 2002), A-580-876, PRRD 8, bar code 4181776-01 (Nov. 12, 2021) (“Coe”). The Court of Appeals concluded that Commerce had departed from “all the cited statistical literature governing Cohen’s d,” and that “[t]he cited literature nowhere suggests simple averaging for unequal-size groups.” Id. at 1377, 1380. Consol. Court No. 15-00213 Page 12 The Court found insufficient Commerce’s explanations that (i) the pricing behaviors of the test and comparison groups represented “separate and equally rational” and “equally genuine” behavior, and (ii) using the simple average provides greater “predictability” in antidumping determinations. 8 Id. at 1379–80 (discussing Second Remand Results at 8–9). The Court emphasized Commerce’s failure to explain the connection between the rationality and genuineness of a seller’s choices and “the undisputed purpose of the denominator figure.” Id. Relatedly, the Court questioned Commerce’s assertion that the means and variances of both test and comparison groups “constitute an abstract effect” which “exclusively define[s] the independent pricing behavior of each group.” Id. at 1380; Second Remand Results at 45. 9 The Court held Commerce failed to explain why focusing on the difference 8 In context, Commerce explained that: For the purpose of this particular analysis, Commerce finds that these two distinct pricing behaviors [of the test and comparison groups] are separate and equally rational, and each is manifested in the individual prices within each group. Therefore, each warrants an equal weighting when determining the “standard deviation” used to gauge the significance of the difference in the means of the prices of comparable merchandise between these two groups. Because Commerce finds that each of these pricing behaviors are equally genuine when considering the distinct pricing behaviors between a given purchaser, region, or time period and all other sales, an equal weighting is justified when calculating the “standard deviation” of the Cohen’s d coefficient. Second Remand Results at 8–9. 9 The Court concluded that the section of Cohen cited by Commerce for the “abstract effect” proposition did not call for simple averaging, and in any case, related to calculating a different measure of effect size, f, instead of Cohen’s d. Mid Continent V, 31 F.4th at 1380. Consol. Court No. 15-00213 Page 13 between the groups calls for a simple averaging yardstick. Mid Continent V, 31 F.4th at 1380. Further, the Court of Appeals concluded Commerce’s assertion that using the simple average provides greater “predictability” was both unclear and inadequate to support its determination. The Court of Appeals stated it was unclear if Commerce was referring to “predictability” as the ability to predict consequences, and if so, it did not see how Commerce had provided a basis for its assertion of greater predictability, given that weighting is calculated on the basis of product weight, value or units, and not number of transactions. 10 Id. On remand, Commerce abandons its arguments justifying the use of a simple average apart from the literature, and instead argues, again, that the literature supports its methodology. See, e.g., Remand Results at 57 (arguing the choice of a simple average is reasonable because it is supported by the academic literature); id. at 42 (responding to the Court of Appeals’ discussion regarding the qualitative factors and stating “the academic literature does contain support for the use of a simple average”); id. at 42–43 (responding to the Court of Appeals’ direction to explain its choice as reasonable or to reconsider, and explaining that the academic literature supports its choice). Commerce does not further explain how the pricing behaviors of 10 Additionally, the Court of Appeals had already reached the same conclusion regarding predictability in Mid Continent III, when Commerce claimed that simple averaging was more predicable than weighted averaging. See Mid Continent III, 940 F.3d at 674. Consol. Court No. 15-00213 Page 14 the test and comparison groups represent “separate and equally rational” and “equally genuine” behavior. Commerce likewise does not provide additional support for its assertion of “predictability.” On remand, Commerce fails to offer any further explanation for its assertions of predictability, abstract effect, and rationality, observing that “[t]he CAFC has already opined in Mid Continent V that these prior arguments are unpersuasive to support that the simple average is reasonable, and now Commerce has taken a new approach which focusses on the academic literature . . . .” Remand Results at 36. Commerce misconstrues the Court of Appeals’ mandate. The Court held that Commerce’s non-academic arguments were unsupported—not unsupportable. See, e.g., Mid Continent V, 31 F.4th at 1379 (“Commerce has not offered an adequate explanation of why [equal rationality and genuineness] support[] the particular step Commerce must justify”); id. (“And in any event, Commerce has not provided a reasonable explanation for this predictability assertion”). Mid Continent attempts to fill the void left by Commerce and argues that there are practical justifications for using the simple average. For example, Mid Continent argues that the “prevalence and sophistication of many respondents’ ‘dump-proofing’ activities” means that using a weighted average could potentially open the door to manipulation of dumping calculations. Mid Continent’s Cmts. at 11. Yet, even if such explanations could support the reasonableness of Commerce’s choice, they are not Commerce’s explanation, and thus they cannot support its determinations. See Consol. Court No. 15-00213 Page 15 SEC v. Chenery Corp., 332 U.S. 194, 196 (1947) (agency action reviewed on grounds invoked by agency). Instead of attempting to comply with the Court of Appeals’ direction “to provide a reasonable justification for departing from what the acknowledge literature teaches,” Commerce interprets the Court of Appeals’ direction as allowing it to offer an explanation, again, of its view of the literature. Even assuming the Court of Appeals left this option open, Commerce’s arguments fail to support its position. First, Commerce argues that the simple average is supported because it uses the full population of sales, and does not estimate means or standard deviations for the test and comparison groups. Remand Results at 14. Therefore, because the literature only contemplates using the weighted average approach when the standard deviations are estimates, Commerce argues that the simple average is supported, and the weighted average is not. Id. This argument fails for two reasons. First, it is unclear what supports Commerce’s premise that Cohen contemplated using equation (2.3.2), the simple average, with full populations. 11 Cohen’s power tables appear to use sample size as an input, see Cohen at 28, and specify that they are to be used “for power analysis in 11PT argues that section 2.3 of Cohen’s book exclusively concerns sampling from a population. PT’s Cmts. at 9. However, as Defendant argues, PT failed to raise this argument in its comments on the preliminary results, so it has not been exhausted. See 28 U.S.C. § 2637; 19 C.F.R. § 351.309(c)(2). Because the court is remanding the matter for further consideration or explanation, Commerce should address this argument in the first instance on remand. Consol. Court No. 15-00213 Page 16 the case where two samples, each of n cases, have been randomly and independently drawn from normal populations.” Id. at 19. Commerce’s reference to equations (2.2.1) and (2.2.2) as “explicitly” calculating effect size based on actual populations seems inconsistent, given that Cohen used these equations to generate d values to create his power tables, not as stand-alone tests. See id. at 20; Remand Results at 35. A test for full populations in the context of power analysis would be redundant on its face, as there would be no question of statistical significance to analyze. Thus, Commerce does not explain, and it is not discernable why Commerce believes that equations (2.2.1) and (2.2.2)—still less equation (2.3.2), which expressly implicates sample size—are intended for testing full populations. 12 Moreover, the Court of Appeals has already held that the literature does not suggest simple averaging for unequal-sized groups. Mid Continent V, 31 F.4th at 1380. Additionally, even if equation (2.3.2) could be used with full populations, Commerce offers no support for its argument that use of the simple average is reasonable in this context. Rather, Commerce argues that the weighted average would be unreasonable, asserting that the literature discussing weighted averaging is exclusively concerned with sampling. Remand Results at 14; see Cohen at 67. 12Defendant argues that PT failed to exhaust its argument that Cohen does not state equation (2.3.2) applies only to full populations. See Def.’s Reply at 15; PT’s Cmts. at 9. In Mid Continent V, the Court of Appeals discusses the use of equation (2.3.2) with sample groups, rather than full populations, implicitly recognizing that the equation does not apply only to full populations. See Mid Continent V, 31 F.4th at 1378. Therefore, because the Court of Appeals has already addressed this issue, the court need not consider whether PT’s argument has been exhausted. Consol. Court No. 15-00213 Page 17 Commerce also asserts that the differential pricing analysis does not involve sampling, but uses full populations, and thus concludes that weighted averaging is inappropriate in light of this distinction. Remand Results at 14. However, Commerce’s premise does not lead to its conclusion. That weighted averaging is supported when sampling is present does not mean that it is unsupported when sampling is absent. Commerce further claims that it may use equation (2.3.2) regardless of sample size, because sample size is only an important factor in the determination of statistical significance. Id. at 15. In support of this proposition, Commerce references Coe, Cohen, and Ellis, who agree that sample size is a necessary input when calculating pooled standard deviation from sampled data. Remand Results at 15; see also Cohen at 40; Coe at 10; Ellis at 10. Commerce’s argument again fails for two reasons. First, although it is true that sample size is necessary to determine statistical significance, it does not follow that sample size is irrelevant where statistical significance is absent. Commerce explains that Cohen’s d is one of the three variables (d, a, and n) used to determine statistical significance with a t-test. Remand Results at 16. Thus, Commerce argues, Cohen’s warning that power values “may be greatly in error” if both sample sizes and standard deviations are unequal does not apply to the differential pricing analysis, which does not calculate power values. Id. As with its “full population” argument, the fact that Commerce is not conducting a power Consol. Court No. 15-00213 Page 18 analysis does not necessarily mean that it may disregard Cohen’s limitations on equation (2.3.2) for calculating d. Commerce’s assertion that equation (2.5.2) requires estimation from a sample while equation (2.3.2) does not require estimation from a sample, appears inconsistent with the literature. See id. at 15 (citing Cohen at 66–67). Although Commerce identifies ɐ஺ and ɐ஻ in equation (2.3.2) as representing standard deviations of full populations, it fails to consider that the ɐ values themselves seem to be used by Cohen as pre-test estimates of the full population value, which will later be calculated with sampling. See Cohen at 44 (stating that equation (2.3.2) is accurate “provided that sample sizes are about equal”); see also Ellis at 10, 10 n.8 (stating of Cohen’s d test “[i]f [the standard deviations] of both groups are roughly the same then it is reasonable to assume that they are estimating a common population standard deviation”). Thus, Commerce’s assertion that sampling is not implicated in equation (2.3.2) is unsupported, as Cohen seems to use this equation in calculating statistical power. Additionally, Commerce’s assertion that the literature provides no support for the weighted average appears to contradict Cohen, Ellis, and Coe at a number of points, as the Court of Appeals has already observed. See Mid Continent V, 31 F.4th at 1378 (“the cited literature uniformly teaches use of a pooled standard deviation that involves weighted averaging”). For example, Cohen’s power tables use a single n value, representing the size of both samples. See Cohen at 40 (“Sample size, n. Consol. Court No. 15-00213 Page 19 This is the size of each of the two samples being compared”) (emphasis in original). The tables appear to be designed for what Cohen calls “Case 0,” which is when ɐ஺ = ɐ஻ and n஺ = n஻ (i.e., when standard deviations are equal and sample sizes are also equal) See id. at 27. Cohen’s “Case 1” (equation (2.3.1)) and “Case 2” (equation (2.3.2)) represent adaptations to Cohen’s power tables, which allow them to function even when an assumption is not met. See id. at 42–44. As Cohen’s examples following each equation demonstrate, however, all three equations are evidently intended for experimental planning. 13 See id. None of Cohen’s many illustrative examples show using simple averaging with unequal samples. Equation (2.5.2), on the other hand, is used for calculating d from experimental data. See Cohen at 67 (“Here, our focus shifts from research planning to the appraisal of research results, and . . . the palpable characteristics of the sample”). Ellis and Coe both expressly prescribe equation (2.5.2) for situations where effect size is being calculated from experimental data. 14 See Ellis at 10 n.8; Coe at 10. Neither author discusses using a simple, unweighted average. Therefore, as the Court of Appeals found in Mid Continent V, Commerce’s 13 Example 2.3 following equation (2.3.1) shows the power that a psychological experimenter could find in a test, given posited values for a, d, n஺ , and n஻ . Cohen at 43. Example 2.4 is similar, and shows the power an economist could expect from running an experiment with posited d, a, and n values. Id. at 45. These examples show that the power tables in section 2.3 of Cohen’s book allow an experimenter to find any one value of t, d, a, and n, provided that the other three variables are fixed. See id. at 14. Thus, it appears that equation (2.3.2) is used as a tool to estimate d in order to find one of the other three variables in a proposed experiment. 14 Coe and Ellis express pooled standard deviation using different notation than Cohen, but their formulae are algebraically equivalent. Compare Ellis at 10 n.8 and Coe at 10 with Cohen at 67. Consol. Court No. 15-00213 Page 20 claim that academia supports the simple average appears to be contradicted by the literature itself. If Commerce continues to rely on the academic literature to support its methodology, it must further explain why its choice of the simple average is reasonable in light of this inconsistency. The matter is remanded to Commerce for further explanation or reconsideration. CONCLUSION In accordance with the foregoing, it is ORDERED that Commerce’s determination is remanded for further explanation or reconsideration consistent with this opinion; and it is further ORDERED that Commerce shall file its remand redetermination with the court within 90 days of this date; and it is further ORDERED that the parties shall have 30 days to file comments on the remand redetermination; and it is further ORDERED that the parties shall have 30 days to file their replies to the comments on the remand redetermination; and it is further ORDERED that the parties shall file the joint appendix within 14 days after the filing of replies to the comments on the remand redetermination; and it is further ORDERED that Commerce shall file the administrative record within 14 days of the date of filing of its remand redetermination; and it is further Consol. Court No. 15-00213 Page 21 ORDERED that PT’s motion for oral argument, see ECF No. 198, is denied as moot. /s/ Claire R. Kelly Claire R. Kelly, Judge Dated: April 3, 2023 New York, New York