Floating-point arithmetic

Assume we use a binary floating-point arithmetic and that RN is the round-to-nearest function. Also assume that c is a constant or a real function of one or more variables, and that we have at our disposal a correctly rounded implementation of c, say ĉ = RN(c). For evaluating x • c (resp. x/c or c/x), the natural way is to replace it by RN(x • ĉ) (resp. RN(x/ĉ) or RN(ĉ/x)), that is, to call function ĉ and to perform a floatingpoint multiplication or division. This can be generalized to the approximation of n/d by RN(n/ d) and the approximation of n • d by RN(n • d), where n = RN(n) and d = RN(d), and n and d are functions for which we have at our disposal a correctly rounded implementation. We discuss tight error bounds in ulps of such approximations. From our results, one immediately obtains tight error bounds for calculations such as x * pi, ln(2)/x, x/(y + z), (x + y) * z, x/sqrt(y), sqrt(x)/y, (x + y)(z + t), (x + y)/(z + t), (x + y)/(zt), etc. in floatingpoint arithmetic.

show abstract

“…Since c is between 1 and 2, the bound ( 8) is always less than the bound (7). Hence, the bound (7) holds in all cases. We immediately deduce…”

Section: The General Case Ulp(ĉx) Ulp(cx)mentioning

confidence: 96%

“…It is wiser to measure errors in terms of ulps of the exact result instead of ulps of the computed result, because the latter choice could lead to dubious conclusions. The authors of [7,Section 2.5] illustrate this as follows:…”

Section: Introductionmentioning

confidence: 99%

Error in Ulps of the Multiplication or Division by a Correctly-Rounded Function or Constant in Binary Floating-Point Arithmetic

Brisebarre

Muller

Picot

2024

IEEE Trans. Emerg. Topics Comput.

Self Cite

View full text Add to dashboard Cite

show abstract

“…For some structures, exponentials e σj |Sj | can be huge. To bypass computer arithmetic problems [18], we introduced bounded matrices M j = M j e −σj |Sj | and explored them to construct the characteristic function. An example of calculated vertical mode, the corresponding function n 0 (z) and n eff , is given in Fig.…”

Section: B Numerical Algorithmsmentioning

confidence: 99%

“…( 8). Notably, analytic formulas representing G r,s and related double integrals G (k,j) (r,s) for large |r| and |s| can imply floatingnumber-arithmetic-related problems [18] since we must handle very large and small exponentials e ± √ r 2 +s 2 β0z . By treating large and small exponentials separately (i.e., replacing possibly huge matrices M p,j with bounded matrices M p,j ), avoiding division of very large and small numbers, and accounting for further computer-arithmetic problems (such as ε + 1 − 1 ≡ 0 whereas ε + (1 − 1) ≡ ε for |ε| < 10 −16 ), we could use otherwise unavailable large values of D: see, e.g., Fig.…”

Section: B Numerical Algorithmsmentioning

confidence: 99%

Optical Mode Calculation in Large-Area Photonic Crystal Surface-Emitting Lasers

Radziunas,

Kuhn,

Wenzel

et al. 2024

IEEE Photonics J.

View full text Add to dashboard Cite

We discuss algorithms and numerical challenges in constructing and resolving spectral problems for photonic crystal surface-emitting lasers (PCSELs) with photonic crystal layers and large (up to several tens of mm 2 ) emission areas. We show that finite difference schemes created using coarse numerical meshes provide sufficient accuracy for several major (lowest-threshold) modes of particular device designs. Our technique is applied to the example of large-area all-semiconductor PCSELs, showing how it can be used to optimize device performance.

show abstract