Loading [MathJax]/jax/output/HTML-CSS/jax.js

Reduction of PDF uncertainty in the measurement of the weak mixing angle at the ATLAS experiment

Figures(12) / Tables(11)

Get Citation
Yao Fu, Siqi Yang, Minghui Liu, Liang Han, Tie-Jiun Hou, Carl Schmidt, Chen Wang and C.–P. Yuan. Reduction of PDF uncertainty in the measurement of the weak mixing angle at the ATLAS experiment[J]. Chinese Physics C. doi: 10.1088/1674-1137/abe36d
Yao Fu, Siqi Yang, Minghui Liu, Liang Han, Tie-Jiun Hou, Carl Schmidt, Chen Wang and C.–P. Yuan. Reduction of PDF uncertainty in the measurement of the weak mixing angle at the ATLAS experiment[J]. Chinese Physics C.  doi: 10.1088/1674-1137/abe36d shu
Milestone
Received: 2020-08-14
Article Metric

Article Views(1937)
PDF Downloads(46)
Cited by(0)
Policy on re-use
To reuse of Open Access content published by CPC, for content published under the terms of the Creative Commons Attribution 3.0 license (“CC CY”), the users don’t need to request permission to copy, distribute and display the final published version of the article and to create derivative works, subject to appropriate attribution.
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Email This Article

Title:
Email:

Reduction of PDF uncertainty in the measurement of the weak mixing angle at the ATLAS experiment

  • 1. Department of Modern Physics, University of Science and Technology of China, Jinzhai Road 96, Hefei 230026, China
  • 2. Department of Physics, College of Sciences, Northeastern University, Shenyang 110819, China
  • 3. Department of Physics and Astronomy, Michigan State University, East Lansing, MI 48823, USA

Abstract: We investigate the parton distribution function (PDF) uncertainty in the measurement of the effective weak mixing angle sin2θeff at the CERN Large Hadron Collider (LHC). The PDF-induced uncertainty is large in proton-proton collisions at the LHC due to the dilution effect. The measurement of the Drell-Yan forward-backward asymmetry (AFB) at the LHC can be used to reduce the PDF uncertainty in the sin2θeff measurement. However, when including the full mass range of lepton pairs in the AFB data analysis, the correlation between the PDF updating procedure and the sin2θeff extraction leads to a sizable bias in the obtained sin2θeff value. From our studies, we find that the bias can be significantly reduced by removing Drell-Yan events with invariant mass around the Z-pole region, while most of the sensitivity in reducing the PDF uncertainty remains. Furthermore, the lepton charge asymmetry in the W boson events as a function of the rapidity of the charged leptons, A±(η), is known to be another observable which can be used to reduce the PDF uncertainty in the sin2θeff measurement. The constraint from A±(η) is complementary to that from AFB, and thus no bias affects the sin2θeff extraction. The studies are performed using the error PDF Updating Method Package (ePump), which is based on Hessian updating methods. In this article, the CT14HERA2 PDF set is used as an example.

    HTML

    I.   INTRODUCTION
    • A measurement of the leptonic effective weak mixing angle, θeff, is one of the most important topics in experimental particle physics. It is the key parameter in electroweak global fitting, and played a crucial role in predicting the mass of the Higgs boson with a precision of O(10) GeV. Going forward, it will continue to contribute in global fittings, and will aid in tests of the Standard Model and in searches for potential new physics beyond the Standard Model. At an energy scale of the Z boson mass (MZ), sin2θeff can be determined from measurements of parity violation in the neutral-current processes of fermion-antifermion scattering, fi¯fiZ/γfj¯fj. One such measurement is the forward-backward asymmetry (AFB), defined as

      AFB=NFNBNF+NB,

      (1)

      where NF and NB are the number of forward and backward events, respectively. At lepton colliders, forward and backward events are defined according to the sign of cosθ, where θ is the scattering angle between the outgoing fermion fj and the incoming fermion fi. The most precise determinations to date of sin2θeff at the Z pole are provided by the LEP and SLD collaborations [1], giving a combined result of 0.23153±0.00016. The precisions of these measurements, achieved at the last generation of e+e colliders, are limited by statistical uncertainties. Subsequent to the LEP/SLD era, measurements have been at hadron collider experiments, i.e., the Tevatron proton-antiproton collider, and the Large Hadron Collider (LHC) proton-proton collider, using AFB in the final states of Drell-Yan (DY) pˉp/ppZ/γ+ processes, as a function of the di-lepton pair invariant mass. At hadron colliders, forward and backward events are defined in the Collins-Soper (CS) rest frame [2]. This is a special rest frame of the lepton pair, with the polar and azimuthal angles defined relative to the two hadron beam directions. The z axis is defined in the Z boson rest frame so that it bisects the angle formed by the momentum of either of the incoming hadrons and the negative of the momentum of the other hadron. The cosine of the polar angle θ is defined by the direction of the outgoing lepton l relative to the ˆz axis in the CS frame and can be calculated directly from the laboratory frame lepton quantities by

      cosθCS=c2(p+1p2p1p+2)mllm2ll+p2T,ll,

      (2)

      where the scalar factor c (either 1 or -1) is defined for the Tevatron and the LHC, respectively, as

      c={1,fortheTevatronpZ,ll/|pZ,ll|,fortheLHC.

      (3)

      Thus, the sign of the z axis is defined as the proton beam direction for the Tevatron, and on an event-by-event basis as the sign of the lepton pair momentum with respect to the z axis in the laboratory frame for the LHC. The variables pZ,ll, mll, and pT,ll denote the longitudinal momentum, invariant mass and transverse momentum of the dilepton system, respectively, and,

      p±i=12(Ei±pZ,i),

      (4)

      where the lepton (anti-lepton) energy and longitudinal momentum are E1 and pZ,1 (E2 and pZ,2), respectively. The DY events are therefore defined as forward (cosθCS>0) or backward (cosθCS<0) according to the direction of the outgoing lepton in this frame of reference.

      Compared to lepton colliders, measurements at hadron colliders suffer from additional uncertainties on modeling the directions of the incoming fermions and antifermions in the initial state. Such uncertainties will dilute AFB and reduce the sensitivity for the determination of sin2θeff. The degree of dilution at hadron colliders is modeled by the parton distribution functions (PDFs). At the Tevatron, fermions in the initial state of DY production are dominated by valence quarks. This allows us to make an assumption that the incoming quark of DY production is moving along the proton beam direction, as indicated in Eq. (3), while the direction of the incoming anti-quark is along the anti-proton beam. However, the contribution from sea-quark interactions is still as large as 10% at the Tevatron. The uncertainty of this dilution fraction, which is calculated using PDFs, will propagate into the uncertainty estimation of the sin2θeff measurement extracted from the AFB distribution. The combination of the D0 and CDF measurements at the Tevatron gives a result of 0.23179±0.00030(stat)±0.00017(PDF)±0.00006(syst) [3], which shows a non-negligible PDF-induced uncertainty.

      The PDF dilution effect is even more significant at the LHC, since it is a proton-proton collider. Due to its completely symmetrical initial state, there is an equal probability of finding the incoming quark of DY production from either of the two proton beams. In order to distinguish forward from backward events in pp collisions, the beam pointing to the same hemisphere as the Z boson reconstructed from final state leptons is assumed to be the one which provides the quark. This is motivated by the observation that the valence quarks inside the protons generally carry more energy than the antiquarks (or sea quarks) inside the protons. However, this assignment is only statistically correct, because it is possible for the sea quarks to have a larger fraction of momentum (x) of the incoming proton than the valence quarks. Furthermore, beyond leading order in the QCD interaction, quark-gluon and antiquark-gluon processes will contribute at next-to-leading order (NLO), and the gluon-gluon process will contribute at next-to-NLO (NNLO). These all affect the PDF dilution factor, whose magnitude depends on the precise modeling of the momentum spectra of all flavors of quarks and gluons involved in the Drell-Yan processes, which is more complicated than just modeling the total cross sections of valence quarks and sea quarks for the proton-antiproton case. Consequently, the PDF-induced uncertainty in the AFB measurement at the LHC is significantly larger than that at the Tevatron. The latest published measurement from the CMS collaboration gives a result of 0.23101±0.00036(stat)±0.00031(PDF)±0.00024(syst) [4], in which the PDF uncertainty is about the same size as the statistical uncertainty. Measurements from LHCb and preliminary studies from the ATLAS collaboration using 8 TeV data have similar conclusions [5, 6].

      In the future high luminosity (HL) LHC era, the statistical uncertainty will be reduced as data accumulates. Thus, the PDF uncertainty will become the leading uncertainty that limits the precision in the determination of sin2θeff. Studies have been done in the literature to discuss how to further reduce the PDF uncertainties relevant for precision electroweak measurements at the LHC [7]. Two experimental observables are essential to this task: one is the AFB of the DY pairs and the other is the lepton charge asymmetry A±(η) in the W± boson events. When AFB is used to simultaneously determine sin2θeff and to reduce PDF uncertainties, it will inevitably bring correlations. Such correlations have not been systematically considered in previous studies, as it is not expected to be important when the PDF-induced uncertainty does not dominate the overall uncertainty. In this article, we investigate the correlation between the two tasks of further reducing the PDF uncertainty and performing the precision determination of sin2θeff from measuring the same experimental observable AFB. We demonstrate the potential bias on the sin2θeff determination, and discuss possible solutions for the future LHC measurements. Note that the dilution effect at hadron colliders has a strong dependence on the Z boson rapidity. Therefore, if the weak mixing angle is extracted from the AFB spectrum observed as a function of Z boson rapidity, such as the measurement of the differential cross section of the Drell-Yan process from ATLAS [8], the bias might be different from that in the extraction using average AFB.

      The paper is organized as follows. In Section II, a brief review on using new data to update PDFs and to reduce the related uncertainties is presented. In Section III, we perform an exercise in updating the PDFs with AFB at the LHC, and demonstrate its potential bias on the sin2θeff determination. In Section IV, we study how updating the PDFs with the lepton charge asymmetry A±(η) measured at the LHC reduces the PDF uncertainties. In Section V, we study the implications of updating the PDFs with both AFB and A±(η) data, and apply the error PDF Updating Method Package (ePump) optimization procedure to illustrate the complementary roles of the sideband AFB and A±(η) observables in reducing the PDF uncertainty, and then to make the optimal choice of bin size of the experimental data used in the PDF-updating analysis. Finally, a summary is presented in Section VI.

    II.   PDF UPDATING METHOD AND AFB
    • The two most commonly-used methods for extracting PDFs and their uncertainties from a global analysis of high-energy scattering data are the Monte Carlo method, used by NNPDF [9], and the Hessian method, as used in CT14HERA2 [10, 11]. In the Monte Carlo method, a statistical ensemble of PDF sets are provided, which are assumed to approximate the probability distribution of possible PDFs, as constrained from the global analysis of the data. In the Hessian method, a smaller number of error PDF sets are provided along with the central set which minimizes the χ2-function in a global analysis. These error PDF sets correspond to the positive and negative eigenvector directions in the space of PDF parameters. The most complete method for obtaining constraints from the new data on the PDFs would be to add the new data into the global analysis package and to do a full re-analysis. However, this is impractical for most users of the PDFs. A technique for estimating the impact of new data on the PDFs, without performing a full global analysis, is very useful. In the context of the Monte Carlo PDFs, the PDF reweighting method has become commonplace. This involves applying a weight factor to each of the PDFs in the ensemble [12-14] when performing ensemble averages. The PDF updating procedure will reduce the overall effective number of PDF replica in the ensemble. The impact of new data can also be estimated directly using Hessian PDFs [15-17], where it is called Hessian profiling, updating the eigenvectors within the Hessian approximation, which is faster and simpler. Note that both the Monte Carlo method and Hessian profiling are based on the original Monte Carlo PDFs or error sets, respectively. Therefore, the new data is assumed to be in general consistent with the PDF predictions before updating, so that the updated best-fit PDF set is not too different from the original best fit. If a large deviation is found between the new data and the original theory predictions, a full analysis of PDF global fitting is needed.

      The theoretical predictions in this work are computed using the ResBos [18] package at next-to-leading order (NLO) plus next-to-next-leading log (NNLL) in QCD, in which the canonical scales are used [19, 20]. The CT14HERA2 central and error PDFs [10, 11] are used in this analysis. AFB as a function of dilepton mass (M) at LHC is sensitive both to sin2θeff and to PDF modeling. Fig. 1 shows the AFB distributions of two separated sin2θeff values of 0.2315 and 0.2345, their difference, and the PDF uncertainties as functions of the di-lepton invariant mass for s=13 TeV pp collisions at the LHC. The two values of sin2θeff are arbitrarily chosen to be far separated in order to clearly reveal their different predictions of AFB. When AFB from a new data set is used in the PDF updating procedure, it is assumed to be consistent with the current theory predictions. This means that sin2θeff, on which AFB depends, is considered to have the same value as determined from existing experimental measurements, even if a different value of sin2θeff is used in generating the pseudo-data. As a result, a simple PDF updating procedure will forcibly absorb the difference in sin2θeff into the PDFs, which will bias the determination of both the updated PDFs and the extracted sin2θeff. The size of the bias depends on how large the difference is between the current accepted value of sin2θeff (used in the theory prediction) and the value used in the generation of the pseudo-data, which will be quantitatively discussed in the following sections. An important thing to note is that AFB is more sensitive to sin2θeff in the Z-pole region, while the PDF-induced uncertainty becomes more significant when M moves to higher or lower regions. The difference in sensitivities of the regions suggests a method to reduce the correlations. The work presented in Ref. [7] was done using the Monte Carlo reweighting method, with NNPDF PDFs, and was based on the hypothesis that the above-mentioned correlation is negligible. In this work, we instead use the software package ePump (error PDF Updating Method Package), which can update any given set of Hessian PDFs obtained from an earlier global analysis [21].

      Figure 1.  (color online) AFB theory prediction as a function of M at the 13 TeV LHC. The luminosity assumed in the samples is 130 fb1. No fiducial acceptance selections are applied. The Δ/σ in the middle panel is defined as (AFB[sin2θeff=0.2345]AFB[sin2θeff=0.2315])/σ, where σ is the statistical uncertainty of the event samples in that mass bin. The bottom panel shows the magnitude of PDF-induced uncertainty of AFB, predicted by the CT14HERA2 error PDFs, at the 68% CL.

    III.   UPDATING THE PDFS WITH AFB DATA
    • In this section, we quantitatively examine how the PDF-induced uncertainty in the determination of sin2θeff can be reduced by applying the Hessian updating method, via ePump, and study the correlation mentioned above. First, we consider the case of using the AFB data spanning the full range of M, from 60 to 130 GeV. Second, we consider the case of using only the AFB sideband spectrum, where events with M from 80 to 100 GeV are excluded.

      In order to perform the PDF update, ePump requires two sets of inputs: data templates and theory templates. The data templates provide AFB distributions with their uncertainties. The theory templates consist of the theory predictions for the AFB from the original PDF error sets. The output of ePump consists of an updated central and Hessian eigenvector PDFs, representing the result that would be obtained from a full global re-analysis that includes the new data. As an additional benefit, ePump can also output the updated predictions and uncertainties for any other observables of interest without having to recalculate using the updated PDFs. For more details about the use of ePump, see Ref. [15].

      For the DY samples, each lepton flavor channel-electron and muon - has 250 million events in the mass range of 60 M130 GeV. This sample size corresponds to an integrated luminosity of roughly 130 fb1, which is reasonably close to the total data collected by the ATLAS detector during the LHC Run 2. The pseudo-data is modeled using the CT14HERA2 central PDFs. Nominal theory-template samples consist of the central and (56) error PDF predictions, generated using the CT14HERA2 error PDFs sets. In the theory templates, sin2θeff is set to be 0.2315, which is the value determined by the LEP and SLD collaborations. In the pseudo-data, sin2θeff is set to be 0.2324 in order to examine the effects of an offset or pull in the new data. The difference is deliberately chosen to be three times the uncertainty of the sin2θeff measurement as determined at hadron colliders [3]. To mimic the experimental acceptance, a set of ATLAS detector-like event selections are further applied to the pseudo-data and the nominal theory samples:

      ● Each lepton is required to have transverse momentum pT 25 GeV.

      ● Lepton pseudo-rapidity is limited by |η| < 4.9.

      ● Events are denoted as CC (central-central) if both leptons have |η|2.5, and CF (central-forward) if one lepton has |η|2.5 and the other has 2.5<|η|<4.9. The CC events correspond to doubling the integrated luminosity with respect to the CF events, since both the dielectron and dimuon channels contribute to the CC events, while only the dielectron channel has CF events at the ATLAS detector.

      ● The Z-pole region is defined as dilepton invariant mass satisfying 80M100 GeV. The sideband region is defined as 60<M<80 GeV and 100<M<130 GeV.

      ● The forward-backward asymmetry AFB is measured in a 2 GeV mass bin size.

      Note that the pseudo-data were treated as coming from just one “experiment”, but in practice both ATLAS and CMS would be sources of input data for fitting. For the CMS case, a CF channel measurement would be difficult due to a limited detector acceptance, while the CC events could be fully used as input.

    • A.   Updating PDFs with AFB using the full mass range

    • First, we will update the PDFs using the full mass range AFB. It is expected that the PDF-induced uncertainty on AFB(M) will be reduced after updating the original PDFs with the inclusion of the pseudo-data. Note that the pseudo-data and theory prediction are generated by the same CT14HERA2 PDFs. If the correlation between sin2θeff and the PDF updating is negligible, we expect no changes in the central value of AFB as predicted by the PDFs after updating, compared to that given in the original theory prediction.

      The predicted AFB distributions (sin2θeff=0.2315) and the associated PDF-induced uncertainties, before and after the PDF updating by using the full mass range of the pseudo-data (sin2θeff=0.2324), are depicted in Figs. 2 and 3 for the CC and CF event samples, respectively. As shown in the bottom panels of the figures, the PDF-induced uncertainties on the predicted AFB are significantly reduced after the updating procedure. This finding is consistent with the conclusion of Ref. [7]. However, as also shown in the middle panels of the figures, the central values of AFB differ before and after the updating, particularly in the Z-pole region. The difference is more significant in CF events than for CC. When the sin2θeff of the pseudo-data has a value different from its value in the theory predictions, the existing PDF model (i.e., CT14HERA2 PDFs, in this study) no longer describes the new data in a consistent way. As a result, the PDF updating procedure would forcibly convert this bias, which originated from a different value of sin2θeff, into an updated central PDF set. The averaged AFB values in the Z-pole region in the pseudo-data and theory predictions, before and after PDF updating, are numerically presented in Tables 1, 2 and 3, for the CC, CF and CC+CF events. The CF events have higher sensitivity to the AFB. For example, as given in the third line of Table 2, the PDF uncertainty can be decreased from 0.00118 to 0.00055, a reduction of more than 50%. Meanwhile, the bias on AFB, originating from the PDF updating, can be as large as Δ=0.00108, as shown in the same line of the table. As we pointed out, there should be no difference in the central value of AFB before and after an unbiased updating, because the pseudo-data and theory templates are generated with the same PDF sets. This bias, which is larger than the statistical uncertainty shown in the third column, indicates that much of the effect of the shift in sin2θeff has been absorbed into the updated PDFs.

      Figure 2.  (color online) Predicted AFB distributions (sin2θeff=0.2315) and associated PDF-induced uncertainties, before and after PDF updating using the full mass range of the pseudo-data (sin2θeff=0.2324). Only CC events are considered. Δ/σ in the middle panel is defined as (AFB[before] - AFB[after])/σ, where σ is the statistical uncertainty of the samples in that bin. The bottom panel shows the magnitude of the PDF-induced uncertainty of AFB, predicted by the CT14HERA2 error PDFs, at 68% CL., before and after updating the PDFs.

      Figure 3.  (color online) Similar to Fig. 2, but for CF events only.

      Update using CC with full AFBaverage AFB at Z polePDF uncertaintyStatistical uncertainty
      pseudo-data sin2θeff=0.2324
      in CC events0.008250.00008
      in CF events0.039830.00017
      in CC+CF events0.013680.00007

      theory prediction in CC events, sin2θeff=0.2315
      before update0.008730.000380.00008
      after update0.008670.000190.00008
      Δ[after-before]−0.00006

      theory prediction in CF events, sin2θeff=0.2315
      before update0.042200.001180.00017
      after update0.042010.000920.00017
      Δ[after-before]−0.00019

      theory prediction in CC+CF events, sin2θeff=0.2315
      before update0.014490.000530.00007
      after update0.014400.000310.00007
      Δ[after-before]−0.00009

      Table 1.  (color online) Average AFB in the Z-pole region in the pseudo-data and theory predictions. The PDF updating was done using the full mass range AFB spectrum from the CC events of pseudo-data (sin2θW=0.2324). The statistical uncertainty corresponds to the data sample with an integrated luminosity of 130 fb1.

      Update using CF with full AFBaverage AFB at Z polePDF uncertaintyStatistical uncertainty
      pseudo-data sin2θeff=0.2324
      in CC events0.008250.00008
      in CF events0.039830.00017
      in CC+CF events0.013680.00007

      theory prediction in CC events, sin2θeff=0.2315
      before update0.008730.000380.00008
      after update0.008560.000260.00008
      Δ[after-before]−0.00017

      theory prediction in CF events, sin2θeff=0.2315
      before update0.042200.001180.00017
      after update0.041120.000550.00017
      Δ[after-before]−0.00108

      theory prediction in CC+CF events, sin2θeff=0.2315
      before update0.014490.000530.00007
      after update0.014160.000320.00007
      Δ[after-before]−0.00033

      Table 2.  Average AFB in the Z-pole region in the pseudo-data and theory predictions. The PDF updating is done using the full mass range AFB spectrum from the CF events of pseudo-data (sin2θeff=0.2324). The statistical uncertainty corresponds to the data sample with an integrated luminosity of 130 fb1.

      Update using CC + CF with full AFBaverage AFB at Z polePDF uncertaintyStatistical uncertainty
      pseudo-data sin2θeff=0.2324
      in CC events0.008250.00008
      in CF events0.039830.00017
      in CC+CF events0.013680.00007

      theory prediction in CC events, sin2θeff=0.2315
      before update0.008730.000380.00008
      after update0.008530.000180.00008
      Δ[after-before]−0.00020

      theory prediction in CF events, sin2θeff=0.2315
      before update0.042200.001180.00017
      after update0.041050.000540.00017
      Δ[after-before]−0.00115

      theory prediction in CC+CF events, sin2θeff=0.2315
      before update0.014490.000530.00007
      after update0.014110.000250.00007
      Δ[after-before]−0.00038

      Table 3.  Average AFB in the Z-pole region in the pseudo-data (with sin2θW=0.2324) and theory predictions (with sin2θW=0.2315). The PDF updating is done using the full mass range AFB, from the CC, CF or CC+CF events of pseudo-data. The statistical uncertainty corresponds to the data sample with an integrated luminosity of 130 fb1.

      To estimate the impact on the determination of sin2θeff in the Z-pole mass region, we express the average AFB approximately as a linear function of sin2θeff in this region, written as

      AFBksin2θeff+b,

      (5)

      where the values of the parameters k and b are listed in Table 4, for CC, CF and CC+CF event samples, respectively. The linear approximation and the values of the slope and offset are acquired using the ResBos [18] generator. The potential bias due to the non-linearity between AFB and sin2θeff, and those higher order corrections, lead to the shift in the central value of sin2θeff to be smaller than 0.00010, and have a negligible effect on the statistical uncertainty estimation [3].

      slope factor koffset factor b
      CC events−0.5310.132
      CF events−2.5120.623
      CC+CF events−1.1100.275

      Table 4.  Simple linear functions between sin2θeff and the observed AFB around the Z pole, predicted by ResBos with CT14NNLO PDFs, for the CC, CF and CC+CF event samples.

      One can roughly estimate the bias and the PDF-induced uncertainty on the determination of sin2θeff, derived from the biased AFB measurement, using the following simplified relation:

      Δsin2θeff=ΔAFB/k.

      (6)

      From the above equation and Table 3, we obtain the results listed in Table 5.

      Update PDF by using full mass range AFBPotential bias on sin2θeffPDF uncertainty on sin2θeffStatistical uncertainty on sin2θeff
      CC events:0.000380.000330.00015
      CF events:0.000460.000210.00007
      CC+CF events:0.000340.000230.00006

      Table 5.  Bias, PDF-induced uncertainty and statistical uncertainty on sin2θeff, after updating the PDFs with the full mass range of the AFB pseudo-data (sin2θeff=0.2324), for the CC and CF event samples, respectively. PDF uncertainties are given at 68% C.L.

      It can be observed that the bias on sin2θeff determined from the biased AFB after the PDF updating is much larger than the PDF-induced uncertainty itself, especially in the CF event sample, which is more sensitive to sin2θeff than the CC event sample. This bias depends on the difference between sin2θeff values in the pseudo-data and the original theory prediction. For this paper, it is intentionally set to an exaggeratedly large difference of 0.0009 for illustration, which is three times the uncertainty obtained from the best hadron collider measurements. A smaller difference between the sin2θeff value of the pseudo-data and the world average value would surely lead to a smaller bias in the AFB measurement after the PDF updating procedure. Nevertheless, this part of our study clearly demonstrates the fact that using the full mass spectrum of the AFB data to update the existing PDFs will introduce bias in the determination of sin2θeff in the Z-pole mass region. With more data collected at the future high luminosity LHC, the weak mixing angle can be determined more precisely, and the sin2θeff measurements with different lepton final states of DY processes at the ATLAS, CMS and LHCb experiments should be considered as separate measurements. Occasionally, one might expect some individual sin2θeff measurements to exhibit significant deviations from the nominal world average value. In that case, the potential bias on the sin2θeff extraction, induced by updating PDFs with the AFB measurement spanning the full mass range, from 60 to 130 GeV, should be accounted for.

      The bias incurred by updating the PDFs using the full mass spectrum can also be observed by looking directly at the PDFs of the quarks and gluons themselves. Figure 4 depicts the comparison of uv, dv, ˉu/uv and ˉd/dv quark PDFs before and after the updating. In Fig. 4, the bias on the d quark PDF is much more significant than that on the u quark PDF. This is caused by the fact that the previous data samples used in the CT14 PDF global fitting provide more constraints on the u quark PDF than the d quark PDF. Therefore, when biased data is introduced, the bias is more easily absorbed into the d quark PDF than the "more fixed" u quark PDF. With an unbiased updating procedure, the central PDF values of the two PDF sets (before and after the PDF updating) should be unchanged, while the updated PDF uncertainties are expected to be reduced after the inclusion of the new pseudo-data. This feature, however, is not confirmed in Fig. 4. Again, this displays how the biased updated PDFs have been changed in order to compensate for the effects of the shifted sin2θeff in the pseudo-data.

      Figure 4.  (color online) Ratios of the central value and uncertainty to the CT14HERA2 central value of the uv, dv, ˉu/uv and ˉd/dv PDFs, before and after the PDF updating. The blue band corresponds to the uncertainty before updating and the red band is after updating. The full mass range of the AFB pseudo-data sample (with sin2θW=0.2324) is used for the updating.

    • B.   Updating PDFs using sideband AFB data only

    • As shown in Fig. 1, the AFB asymmetry is more sensitive to sin2θeff when M is around the Z-pole mass, while the PDF-induced uncertainty becomes more significant when M is outside the Z-pole mass window. This is because in the Z-pole region the asymmetry is proportional to both the vector and axial-vector couplings of the Z boson to the fermions and is numerically close to 0. Since only the vector coupling of the Z boson depends on the weak mixing angle, the information on sin2θeff predominantly comes from AFB in the vicinity of the Z pole. When moving away from the Z boson mass pole, the asymmetry results from the interference of the axial vector Z coupling and vector photon coupling and depends upon the PDFs. On the other hand, the sensitivity of constraining the PDFs via a measurement of AFB depends on the value of the asymmetry (see Appendix A). Consequently, the AFB-to-PDF sensitivity is suppressed in the Z-pole region where the value of the asymmetry is close to zero, and is enhanced outside the Z-pole mass window with magnified AFB value. This observation suggests that we could separate the AFB distribution into Z-pole region and sideband region, and use them for sin2θeff determination and PDF updating procedure, respectively. This procedure could reduce the correlation, and keep most of the sensitivities.

      To confirm this, we generate and use a new pseudo-data sample with a different value of sin2θeff (as = 0.2345) in this section, such that the difference between sin2θeff values in the pseudo-data and the original theory templates is ten times the best precision at hadron colliders. When this new pseudo-data sample was generated, Z-pole events with M from 80 to 100 GeV were explicitly excluded. Following the same analysis procedures as discussed in the previous section, we obtain the numerical results listed in Tables 6, 7 and 8, which summarize the impact of CC, CF and CC+CF events, respectively.

      Update using CC with sideband AFBaverage AFB at Z polePDF uncertaintyStatistical uncertainty
      pseudo-data sin2θeff=0.2345
      in CC events0.007140.00008
      in CF events0.034900.00017
      in CC+CF events0.011920.00007

      theory prediction in CC events, sin2θeff=0.2315
      before update0.008730.000380.00008
      after update0.008720.000240.00008
      Δ[after-before]−0.00001

      theory prediction in CF events, sin2θeff=0.2315
      before update0.042200.001180.00017
      after update0.042180.000980.00017
      Δ[after-before]−0.00002

      theory prediction in CC+CF events, sin2θeff=0.2315
      before update0.014490.000530.00007
      after update0.014480.000360.00007
      Δ[after-before]−0.00001

      Table 6.  Average AFB in the Z-pole region in the pseudo-data and theory predictions. The PDF updating is done using the sideband AFB spectrum from the CC events of pseudo-data (sin2θeff=0.2345). The statistical uncertainty corresponds to the data sample with an integrated luminosity of 130 fb1.

      Update using CF with sideband AFBaverage AFB at Z polePDF uncertaintyStatistical uncertainty
      pseudo-data sin2θeff=0.2345
      in CC events0.007140.00008
      in CF events0.034900.00017
      in CC+CF events0.011920.00007

      theory prediction in CC events, sin2θeff=0.2315
      before update0.008730.000380.00008
      after update0.008680.000270.00008
      Δ[after-before]−0.00005

      theory prediction in CF events, sin2θeff=0.2315
      before update0.042200.001180.00017
      after update0.041720.000730.00017
      Δ[after-before]−0.00048

      theory prediction in CC+CF events, sin2θeff=0.2315
      before update0.014490.000530.00007
      after update0.014370.000360.00007
      Δ[after-before]−0.00012

      Table 7.  Average AFB in the Z-pole region in the pseudo-data and theory predictions. The PDF updating is done using the sideband AFB spectrum from the CF events of pseudo-data (sin2θeff=0.2345). The statistical uncertainty corresponds to the data sample with an integrated luminosity of 130 fb1.

      Update using CC + CF with sideband AFBaverage AFB at Z polePDF uncertaintyStatistical uncertainty
      pseudo-data sin2θeff=0.2345
      in CC events0.007140.00008
      in CF events0.034900.00017
      in CC+CF events0.011920.00007

      theory prediction in CC events, sin2θeff=0.2315
      before update0.008730.000380.00008
      after update0.008680.000220.00008
      Δ[after-before]−0.00005

      theory prediction in CF events, sin2θeff=0.2315
      before update0.042200.001180.00017
      after update0.041730.000720.00017
      Δ[after-before]−0.00047

      theory prediction in CC+CF events, sin2θeff=0.2315
      before update0.014490.000530.00007
      after update0.014370.000320.00007
      Δ[after-before]−0.00012

      Table 8.  Average AFB in the Z-pole region in the pseudo-data and theory predictions. The PDF updating is done using the sideband AFB spectra from both the CC and CF events of pseudo-data (sin2θeff=0.2345). The statistical uncertainty corresponds to the data sample with an integrated luminosity of 130 fb1.

      Since the inclusive production rate of the Z boson is dominated by the contribution from the Z-pole mass window, the constraint on the PDF uncertainty obtained from using only the sideband AFB data sample is not as statistically powerful as that using the full mass range AFB data sample. For example, comparing the sideband result (in Table 8) to the full mass range result (in Table 3), we find that the PDF uncertainty only reduces to 0.00072 for sideband updating, compared to 0.00054 for full mass range updating, in the case of using the most sensitive CF event sample. However, the bias on the average AFB in the Z-pole mass window is much smaller in the sideband updating (with Δ=0.00047 and sin2θeff=0.2345) than that in the full mass range updating (with Δ=0.00115 and sin2θeff=0.2324), as listed in the same tables. Furthermore, in contrast to the strong variation observed in Fig. 4, we find much less bias on various parton flavor PDFs when using only the sideband AFB data to update the PDFs, as shown in Fig. 5. Note that the PDF value is close to zero in the extremely low and high x regions, and there is a lack of data constraints there. This is the dominant reason that the PDF error bands are dramatically larger.

      Figure 5.  (color online) Ratios of the central value and uncertainty to the CT14HERA2 central value of the uv, dv, ˉu/uv and ˉd/dv PDFs, before and after the PDF updating. The blue band corresponds to the uncertainty before updating and the red band is after updating. The sideband mass range of the AFB pseudo-data sample (sin2θeff=0.2345) is used for the updating.

      Using the numbers from Table 8, the impact of updating the PDFs with the sideband AFB data on the determination of sin2θeff is summarized in Table 9. In comparison to the result using the full mass range AFB data in the updating, cf. Table 5, the PDF-induced uncertainty from using only the sideband AFB data sample increases by about 20% ~ 30%, but the biases on sin2θeff diminish dramatically, despite using the much larger value of sin2θeff=0.2345 in the present case. Since the bias introduced by using the full mass range AFB updating is apparently larger than the PDF-induced uncertainties, to reduce the bias on sin2θeff by using the sideband AFB updating should have higher priority than keeping the statistical uncertainty 20% ~ 30% smaller. One should optimize the mass window for a specific measurement to have better balance between bias and sensitivities.

      Update PDF using sideband AFBPotential bias on sin2θeffPDF uncertainty on sin2θeffStatistical uncertainty on sin2θeff
      CC events:0.000090.000410.00015
      CF events:0.000190.000290.00007
      CC+CF events:0.000110.000290.00006

      Table 9.  Bias, PDF-induced uncertainty, and statistical uncertainty on sin2θW, after updating the PDFs with the sideband range of the AFB pseudo-data (with sin2θW=0.2345), for the CC and CF event samples, respectively. The PDF uncertainties are given at 68% C.L.

      Considering the fact that the new data might not correspond to a value of sin2θeff as large as 0.2345, we can conclude that by the end of the LHC Run 2, the potential bias of using the sideband AFB data in the PDF updating should be small. However, this does not mean it can be ignored. As we have seen, by using the sideband AFB in the PDF updating one can reduce the effects of any potential bias, while not significantly enlarging the total uncertainty of the sin2θeff determination. Furthermore, we strongly suggest keeping the PDF updating as a preliminary method to improve the sin2θeff measurement. A final determination of sin2θeff and its uncertainty estimation can only be reliably provided by a full global analysis, which includes new data sets and allows a thorough study on adding new degrees of freedom in the nonperturbative PDF parameters, etc. Experimental results should also be provided in a proper format, allowing theorists to replace the preliminary PDF updating method employed in the experimental measurement by a consistent global analysis.

    IV.   UPDATING PDFS USING LEPTON CHARGE ASYMMETRY A±(η) IN W PRODUCTION
    • In this section, we investigate the advantage of using the asymmetry in the rapidity distribution of the charged leptons from Wlν boson decays, produced at the LHC, to update the PDFs. In pp collisions, W+ and W have different cross sections, and accordingly an asymmetry can be defined as a function of the final state charged lepton rapidity η:

      A±(η)=NW+(η)NW(η)NW+(η)+NW(η).

      (7)

      A measurement of lepton charge assymmetry from the ATLAS collaboration, using 8 TeV data, is published in Ref. [22]. This asymmetry is caused by the difference between up and down type quarks and their anti-quark distributions in the proton, and thus provides complementary information to AFB in constraining the PDFs. Although using A±(η) as input is essential to many other PDF constraints, it has less impact on the sin2θeff measurements, compared to using AFB in the PDF updating. In general, A±(η) is an initial state asymmetry, directly reflecting the difference between W+ and W production rates at the LHC, and has little dependence on the weak interaction decays.

      To study the impact of A±(η) on reducing the PDF-induced uncertainty in the AFB measurement, we generate a set of W boson samples, in which the sin2θeff value is taken to be different from the original theory templates, as done in the previous DY case. To model the ATLAS acceptance, the charged leptons (electrons and muons) from the W boson decay are required to have |η|<2.5. Forward electrons are usually removed from the single W production measurement, due to difficulties in controlling the backgrounds in the high rapidity region. Both charged leptons and neutrinos are required to have pT > 25 GeV. A bin size of 0.1 on |η| is used in the A±(η) distributions. The A±(η) distributions, together with the PDF-induced uncertainties, before and after the PDF updating, are shown in Fig. 6.

      Figure 6.  (color online) A±(η) theory prediction at the 13 TeV LHC with an integrated luminosity of 130 fb1, as a function of the charged lepton rapidity η. Δ/σ in the middle panel is defined as (A±(η)[before] - A±(η)[after])/σ, where σ is the statistical uncertainty of the samples in that bin. The bottom panel shows the magnitude of the PDF-induced uncertainty of A±(η), predicted by the CT14HERA2 error PDFs, at 68% CL., before and after updating the PDFs.

      The values of the average AFB and their PDF-induced uncertainty after updating PDFs with the simulated lepton charge asymmetry pseudo-data are listed in Table 10. The PDF-induced uncertainty on the average AFB is reduced by 17% for CC, and 13% for CF events, after updating PDFs with the A±(η) data. The central prediction for AFB does not change after updating PDFs with the A±(η) data, since there is no direct correlation between the value of sin2θeff and A±(η).

      Update using A±(η) in pseudo-dataaverage AFB at Z polePDF uncertaintyStatistical uncertainty
      theory prediction in CC events, sin2θeff=0.2315
      before update0.008730.000380.00008
      after update0.008730.000310.00008
      Δ[before-after]<0.00001

      theory prediction in CF events, sin2θeff=0.2315
      before update0.042200.001180.00017
      after update0.042190.001030.00017
      Δ[before-after]0.00001

      theory prediction in CC+CF events, sin2θeff=0.2315
      before update0.014490.000530.00007
      after update0.014490.000440.00007
      Δ[before-after]<0.00001

      Table 10.  Average AFB in the Z-pole region before and after the PDF updating. The PDF updating is done using A±(η) from W boson production. Statistical uncertainty corresponds to the data sample with an integrated luminosity of 130 fb1.

      Figure 7 depicts the comparison of d, uvdv, d/u and (uˉddˉu)/(uˉd+dˉu) PDFs, between the nominal CT14HERA2 and the updated PDFs with the inclusion of the A±(η) data. It shows that the potential bias on the central values of the PDFs is negligible, while a noticeable reduction of the PDF uncertainty can be clearly observed in some relevant x ranges, depending on the parton flavor.

      Figure 7.  (color online) Ratios of the central value and uncertainty to the CT14HERA2 central value of the d, uvdv, d/u and (uˉddˉu)/(uˉd+dˉu) PDFs, before and after the PDF updating. The blue band corresponds to the uncertainty before updating and the red band is after updating. The lepton charge asymmetry A±(η) data is used in the updating.

    V.   UPDATING PDFS USING BOTH SIDEBAND AFB AND LEPTON CHARGE ASYMMETRY A±(η)
    • Since the Drell-Yan AFB and the lepton charge asymmetry A±(η) provide complementary information, it is expected that the PDF-induced uncertainty in the determination of sin2θeff can be further reduced if we use both the sideband AFB and the A±(η) data together to update the PDFs. The sin2θeff value in the W and Z pseudo-data samples generation is set to be 0.2345, although for the W samples, ResBos does not use sin2θeff as direct input, and it is thus independent. Applying the same analysis to those two pseudo-data sets, as detailed in the previous sections, we obtain the results listed in Table 11, which should be directly compared to Table 9 for using the sideband AFB data and Table 10 for using the A±(η) data alone. We find that using both data sets to update the PDFs could further reduce the PDF-induced uncertainty on the sin2θeff measurement, which is determined using the AFB data in the Z-pole mass window, by about 28% as compared to that using only the sideband AFB data.

      Update PDF using sideband AFB and A±(η)Potential bias on sin2θeffPDF uncertainty on sin2θeffStatistical uncertainty on sin2θeff
      CC events:0.000090.000320.00015
      CF events:0.000180.000210.00007
      CC+CF events:0.000100.000210.00006

      Table 11.  Bias, PDF-induced uncertainty and statistical uncertainty on sin2θW, after updating the PDFs with the sideband range of the AFB pseudo-data (with sin2θW=0.2345) and the A±(η) pseudo-data samples, for the CC and CF event samples, respectively. The PDF uncertainties are given at 68% C.L.

    VI.   AN APPLICATION OF EPUMP-OPTIMIZATION
    • To further discuss the improvement in PDF-induced uncertainties, we apply the ePump optimization method of the ePump  code in this section, in the combined analysis of the AFB and A±(η) data, (1) to demonstrate their complementary roles in reducing the PDF uncertainty in the PDF-updating procedure; and (2) to investigate the optimal choice of bin size for studying the PDF-induced uncertainty in experimental observables related to those two individual data.

    • A.   Complementary roles in reducing the PDF uncertainty

    • The ePump optimization (or PDF-rediagonalization) method is based on ideas similar to that used in the data set diagonalization method developed by Pumplin [23].

      For a set of new data points, the application constructs an equivalent set of eigenvectors, which are orthogonal to each other in the PDF fitting parameter space, by re-diagonalizing the original Hessian error PDFs with respect to the given data. The total uncertainty calculated by the new eigenvectors is exactly identical to that calculated with the original error PDFs in the linear approximation assumed by the Hessian analysis. However, in addition, the new error PDF pairs are ordered by the magnitudes of their re-calculated eigenvalues, the sum of which should be identical to the total number of the given data points, as noted in Ref. [15]. That is to say that the new eigenvectors can be considered as projecting the original error PDFs to the given data set, and be optimized or re-ordered so that it is easy to choose a reduced set that covers the PDF uncertainty for the input data set to any desired accuracy [15].

      As an example, after applying the ePump optimization method to the CT14HERA2 PDFs for the sideband AFB and A±(η) data sets, which contain 50 data points in total (i.e., 25 bins in each case), we find that the top three new eigenvector pairs predominantly have eigenvalues of 25.2, 18.1 and 5.5, respectively, while the eigenvalues of remaining ones decrease rapidly after that. The combination of these top 3 optimized error PDFs contributes up to 97.6% in the total PDF variance of the 50 given data points. This ePump optimization allows us to conveniently use these 3 leading new eigenvectors, in contrast to applying the full 56 error sets of the CT14HERA2, to study the PDF-induced uncertainty on AFB and A±(η) or any other observable that is directly related to them.

      The relative contributions of the top three leading optimized eigenvectors to the PDF uncertainties of the sideband AFB and A±(η), normalized to each bin for illustration, are shown in Fig. 8. One can see directly that the first eigenvector (labeled as EV01) gives by far the largest contribution to the PDF uncertainties of the sideband AFB, but a very small fraction of the uncertainties of A±(η), particularly for |η|>1. The second and third eigenvectors (labeled as EV02 and EV03) contribute a large or appreciable amount of the uncertainties on A±(η), but a much smaller fraction on AFB. This suggests that when optimizing PDFs using both the AFB and A±(η) samples, these two data sets play complementary roles in reducing the PDF uncertainties, i.e., the re-diagonalization of the first pair of eigenvectors is dominated by the information from the AFB and the second pair has more information from A±(η).

      Figure 8.  (color online) Fractional contribution of the top three leading optimized eigenvectors (EV01, EV02 and EV03) to the variance of the observables AFB and A±(η), normalized to each bin respectively, in the combined ePump optimization analysis.

      The sensitivities provided by the top two pairs of eigenvector PDFs to the different flavor and x-range, probed by the sideband AFB and the lepton charge asymmetry A±(η) together, are depicted in Fig. 9 and Fig. 10, respectively. It can be verified that these two leading pairs of error PDFs, optimized by using both of those two data sets, resemble the respective first pair of eigenvector PDFs after applying the ePump optimization procedure to the sideband AFB and the A±(η) alone. This information can be understood from the following physical argument. AFB is dependent on PDFs predominantly because the dilution effect could lead to an incorrect assignment of the z direction of the Collins-Soper definition. At the LHC, the leading order dilution probability that forward and backward is misjudged depends only on the relative size of PDF ratios u/ˉu and d/ˉd, meaning that it is more sensitive to the quark-antiquark comparison. For A±(η), this asymmetry comes from the difference between the uˉd cross section and the dˉu cross section, meaning that it is more sensitive to the flavor difference. As shown in Fig. 9, the first eigenvector pair, which gives the largest PDF contribution to the AFB uncertainty, dominates the u/ˉu uncertainty in the x region of a Z boson process. The d/ˉd uncertainty is not as dominated by the first eigenvector, because the Z-quark couplings of neutral vector current, which govern the magnitude of AFB at parton level, are proportional to the electric charges of different quark types, so that the sensitivity of AFB to d/ˉd parton distribution is suppressed. Since the observed AFB is a combination of uˉu and dˉd processes, it can provide some information on the difference between u and d quark PDFs, but it is not as sensitive to this as it is to the u/ˉu and d/ˉd ratios. In Fig. 10, the second eigenvector pair, which gives the largest PDF contribution to the A±(η) uncertainty, dominates the d/u and ˉd/ˉu uncertainties in the x region of the single W boson process. However, it has almost no sensitivity to the ˉu/uv uncertainty in the very large x-range.

      Figure 9.  (color online) Ratios of the first pair of eigenvector PDFs and the original CT14HERA2 error PDFs, at Q=100 GeV, to the CT14HERA2 central value of the ˉu/uv, ˉd/dv, d/u and ˉd/ˉu PDFs. Those eigenvector PDFs were obtained after applying the ePump optimization to the original CT14HERA2 PDFs in the combined analysis of the Drell-Yan sideband AFB and the lepton charge asymmetry A±(η) data.

      Figure 10.  (color online) Same as Fig. 9, but for the second pair of eigenvector PDFs.

    • B.   Optimal choice of bin size

    • In previous sections, a bin size of 2 GeV on mass was used for measuring the AFB distribution, and a bin size of 0.1 on η was used for A±(η). In principle, using a large bin size will smear some fine structures of the AFB and A±(η) distributions, and make those observables less sensitive to variations of the PDFs. Hence, it is desirable to determine the maximum allowed bin size without losing sensitivity for a given observable. Due to the difficulty in the experimental unfolding procedure to remove detector effects, such as bin-to-bin migration effects and determination of efficiency and acceptance, it may not always be practical to measure the AFB and A±(η) distributions in such a fine bin configuration. In this section, we discuss how to apply the ePump optimization procedure to obtain the optimal choice of bin size for AFB and A±(η) distributions.

      From Sec. VIA, we learned that the PDF-induced error on AFB and A±(η) can be represented by the leading eigenvectors after ePump optimization. In Fig. 11, we show the AFB distributions, predicted by the first two eigenvector PDF sets, after PDF-rediagonalization. For each eigenvector, positive and negative shifted PDF error sets are compared. Similarly, for the A±(η) distribution, comparisons are shown in Fig. 12.

      Figure 11.  (color online) AFB distribution predicted by the first and second eigenvector PDF sets, after applying the ePump optimization method to optimize the CT14HERA2 PDFs for the AFB data, as described in the text. Predictions from the positive and negative shifted error sets of each eigenvector PDF set are compared, and ΔAFB is their difference.

      Figure 12.  (color online) Similar to Fig. 11, but for A±(η) distribution.

      When the PDFs are varied according to the first pair of eigenvector sets, the most significant change in the shape of the AFB distribution occurs as an oppositely shifted effect in the high mass and low mass regions around the Z pole, cf. the left-hand plot of Fig. 11. Moreover, the shape of Δ is almost flat either below or above the Z-pole mass window, where Δ is the difference between the two values of AFB predicted by the positive and negative shifted error sets. As a result, using a larger bin size on mass will not lose much information on how the PDFs affect the AFB distribution. On the other hand, as shown in the right-hand plot of Fig. 12, when the PDFs are varied according to the second pair of eigenvector sets, the change of Δ in the shape of A±(η) distribution is almost a linear-type. Hence, as long as the bin size on lepton rapidity still reflects the linear shape, the sensitivity of A±(η) to PDF variations should not be dramatically reduced.

      To quantitatively study the sensitivity loss from using a larger bin size, we compare with another analysis done by using a bin size of 5 GeV on mass for the AFB distribution, and a bin size of 0.25 on lepton η for the A±(η) distribution, for the same data samples as used in Section V. We find that numerical calculations using these wide bins give exactly the same results as those presented in the previous tables, which implies that the reduction of PDF uncertainty would not be compromised by using a larger bin size, as proposed above. This leads to a very useful conclusion: aiming for the sin2θeff measurement, both AFB and A±(η) distributions can be measured in a large bin size to reduce systematic uncertainties without losing much sensitivity in constraining the PDFs. This conclusion should hold for both a quick PDF-updating and a full PDF global fitting. This conclusion is important, because as more data accumulates at the LHC, systematic uncertainties will soon be larger than the statistical uncertainty for many precision measurements. Therefore, reducing systematics should be of a higher priority.

    VII.   SUMMARY
    • We have presented a study on how to correctly reduce the PDF-induced uncertainty in the determination of the effective weak mixing angle sin2θeff, obtained from analyzing the measurement of the Drell-Yan forward-backward asymmetry AFB at the LHC. According to previous studies, the PDF-induced uncertainty can be reduced by the PDF updating procedure using AFB. However, when AFB is used for both PDF updating and sin2θeff extraction, the correlation between these two important tasks will cause bias on both the updated PDFs and the extracted value of sin2θeff. Considering the deviation between the previous precise measurements on sin2θeff, such bias could be at the same level as the PDF-induced uncertainty on sin2θeff. In this paper we have shown how this bias can be suppressed. AFB is more sensitive to sin2θeff around the Z pole, while the PDFs affect AFB more significantly in the sideband regions such as 60<Mll<80 GeV and 100<Mll<130 GeV. Accordingly, we propose to use the sideband AFB to reduce the correlation between the sin2θeff extraction and the PDF updating, so that the bias on the sin2θeff determination can be suppressed, while not significantly losing sensitivity in the PDF updating.

      We have applied the ePump  program, based on the Hessian updating method, to update the CT14HERA2 PDFs by including the full mass range AFB pseudo-data as new input to a global PDF fitting. With this updated PDF set, we analyzed the extraction of sin2θeff in the Z-pole mass window and found a sizable bias in its value, with respect to its input value in the pseudo-data. Furthermore, the central values of the updated d and u quark PDFs, obtained from this analysis, are different from those of the original CT14HERA2 PDFs. This is caused by the difference in the sin2θeff values assumed in the pseudo-data and the theory templates. To reduce this type of correlation, we proposed to use only the sideband AFB to update the existing PDFs. As expected, using only the sideband AFB data to update the PDFs reduces the bias on the extraction of sin2θeff value as well as the central values of the updated PDFs. We also show that the asymmetry from W boson decay, A±(η), can be used to further reduce the PDF uncertainty. It plays a complementary role to the sideband AFB data in reducing the PDF-induced uncertainty, with negligible bias on the determination of the weak mixing angle.

      A study on the effect of choosing different bin sizes of the AFB and A±(η) distributions was also performed. It showed that using a somewhat larger bin size will not sacrifice much of the sensitivity of those two observables in reducing the PDF uncertainty in the sin2θeff measurement. When more data are accumulated at the LHC, the systematic uncertainties in the AFB and A±(η) measurements will begin to dominate. In that case, there is an advantage in choosing a larger bin size in order to reduce the systematic uncertainties in the experimental unfolding procedures. In this study, using a bin size of 5 GeV on mass for the AFB distribution, and a bin size of 0.25 on lepton η for the A±(η) distribution, did not cause a noticeable reduction in the sensitivity of these two data sets to the measurement of sin2θeff.

      In conclusion, we have investigated the correlation and potential bias in reducing the PDF-induced uncertainty in the determination of sin2θeff from the forward and backward asymmetry AFB of the Drell-Yan processes at the LHC. Derived from quantitative computation of the Hessian-based ePump  PDF updating program, it can be concluded that by excluding Z-pole region events in the PDF updating, the potential bias on the sin2θeff extraction would not significantly enlarge the estimated total uncertainty, including the statistical and PDF-induced uncertainties at the LHC Run 2. However, the bias is not negligible and thus still needs careful evaluation in future precise sin2θeff measurements at the high luminosity LHC. Moreover, although it is useful to quickly use ePump  to estimate the impact of a new data set on the PDFs, we suggest using the PDF updating method as only a preliminary way to reduce the PDF-induced uncertainty in the sin2θeff measurements. A full PDF global fitting analysis is necessary for a complete determination of sin2θeff with PDF correlations, in which new degrees of freedom in the non-perturbative parametrization of the PDFs can be explored. Furthermore, all experimental results, i.e., AFB and A±(η) studied in this article, should be provided in a format such that theorists can replace the preliminary PDF updating method employed in the experimental analysis by a consistent global analysis.

    ACKNOWLEDGMENTS
    • C.-P. Yuan is grateful for support from the Wu-Ki Tung Endowed Chair in Particle Physics.

    • APPENDIX A: DILUTION EFFECT ON AFB

    • Consider that if the directions of the initial state quarks and antiquarks in the DY events are known, one can define a Collin-Soper frame at hadron colliders without any dilution effect. In this situation, the differential cross section of the DY process can be written as:

      dσqdcosθq(1+cos2θq)+Aq0×12(13cos2θq)+Aq4×cosθq,

      where the label q is used to mark the no-dilution cross section at partonic level. In reality, the dilution effect can lead to an incorrect assignment of the z direction in the Collins-Soper frame, resulting in:

      cosθh=cos(πθq)=cosθq,

      where the label h is used to mark the dilution case at hadronic level. With a dilution probability of f, we have:

      dσhdcosθh=f×dσqdcosθq|cosθq=cosθh+(1f)×dσqdcosθq|cosθq=cosθh(1+cos2θh)+Aq0×12(13cos2θh)

      +(12f)×Aq4×cosθh.

      Accordingly,

      Ah4.=(12f)Aq4

      Since the AFB around the Z pole is proportional to the angular coefficient A4, it turns out that

      AhFB=(12f)AqFB

      The sensitivity of constraining the PDFs (namely constraining f) via AFB depends on the AFB value itself.

Reference (23)

目录

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return