Interactive SuperComputing


 

Success Stories

Computational Photonics Design at
the University of Central Florida


PDF of Case Study

The Challenge

Rigorous Coupled Wave Analysis (RCWA), also known as the Fourier Modal Method (FMM), is a frequency domain, eigenmode expansion, spectral method for rigorously solving Maxwell’s equations of electromagnetism in periodic and artifcially aperiodic, wavelength and subwavelength scale geometries. This fully vectorial computational technique is widely used for modeling the steady-state properties of grating and photonic crystal based wave guiding and diffractive integrated optical devices in both two and three spatial dimensions.

Eigenmode expansion/Scattering Matrix (S-Matrix) techniques are some of the most efficient means of rigorously solving the field and power distributions within certain classes of primarily periodic wavelength scale integrated optical devices, especially when compared to other widely used methods (FDTD, FEM, BPM), but as the geometric complexity and size of a single device increases or if performing geometric optimizations or tolerance studies are of interest, the memory requirements and computation time needed to perform a device study can quickly become impractical for the resources of a single desktop.

In a recent application of this method, researchers at the University of Central Florida ’s College of Optics and Photonics explored using an RCWA/S-Matrix technique with Perfectly Matched Layer (PML) boundary conditions to calculate the power flow throughout a high refractive index contrast photonic-wire S-bend waveguide. The S-bend geometry is approximated with 101 slices, and each layer uses 251 spatial harmonic basis functions to calculate the guided modes and radiation modes of each slice. Consequently, 103 separate layer, complex-valued, asymmetric eigenmode problems of size 251 by 251 are solved and 103 separate individual layer complex-valued scattering matrices of size 251 by 251 are created.

Opportunity for Task-Parallel Computation
This scenario of propagating energy through an asymmetric, non-periodic structure could make use of task parallelism in order to calculate the individual layer eigenproblems and scattering matrices, as well as using a domain decomposition/binary-tree approach to separate the device into separate sections and then processing each section in a relatively independent fashion. In such a setting, the necessary scattering matrix product operations could be performed in parallel for each section.

Furthermore, for performing geometric or spectral parameter scans or optimizations, the task-parallel computations can be integrated to efficiently study the nature of narrow spectral resonances over broadband regions or to perform multi-parameter optimization / evolutionary algorithm studies.

Opportunity for Data-Parallel Computations
Data parallel operations may also become necessary if the number of longitudinal slices in the system is increased, thereby increasing the number of unique eigenmode problems and scattering matrices to be stored. Furthermore, as the transverse size of the computational window is increased, the number of system basis functions also needs to be increased to maintain convergence in the layer eigenmode problems, thereby increasing the storage requirements for all matrices in the system.

Additionally, if the 2D problem is transformed into a 3D problem, by the introduction of a second transverse dimension, the Fourier harmonic grid utilized in each eigenmode problem then becomes a 2D lattice. Depending on the window sizes in each transverse spatial dimension, the total number of basis functions needed to maintain eigenmode convergence may jump from the hundreds into the thousands. Furthermore, the eigenmode problem that is to be solved in the case with two transverse dimensions is a 2Nx2N matrix problem, doubling or quadrupling the storage requirements of every matrix in the system.

In three-dimensional device studies, when hundreds or thousands of basis functions may be needed to obtain convergence in a single layer’s eigenmodes, or for 2D/3D continuously varying longitudinal geometries that may require hundreds of unique individual layer eigenmode problems, individual layer scattering matrices, and intermediary scattering matrices, a distributed memory system must be used to store data that could not otherwise fit within a single desktop’s memory.

Star-P Solution

Together, the data and task parallel capabilities of Star-P can dramatically improve the efficiency and capabilities of an RCWA/S-Matrix computation. In a recent application of Star-P to the RCWA/S-Matrix algorithm, the properties of a novel high refractive index dual grating output coupler were studied extensively. The task parallel properties of Star-P were integrated with an RCWA/S-Matrix code that had previously been written purely in MATLAB®®, reducing run times from 4-6 hours on the desktop, to less than 30 minute runs on a 16-processor server.

Very fine scale geometric parameter scans were performed over large scale parameter ranges that simultaneously show both narrowband and broadband features of the system response. By choosing optimal geometric properties and avoiding potential geometric problems that were discovered, an efficient, all-dielectric, unidirectional, dual grating output coupler was designed. Results from this study can be viewed in the following paper: http://www.opticsexpress.org/abstract.cfm?id=125451

Summary & Metrics

  • Straightforward parallelization of data- and task-parallel computation
  • 11X speed-up on 16-processor server
  • Ability to explore the parameter space and fine-tune the design

11x speed-up on 16-processor server

A similar set of task parallel codes were used to explore the parameter space in the design of multiple wavelength resonant grating filters having broadened angular acceptance at a single angle of incidence. Results from this study can be viewed at the following location:

http://www.opticsexpress.org/abstract.cfm?id=138977

Back to Success Stories