Publications
- Han Tran, Siddhath Saurav, Sandip Mazumder, Ponnuswamy Sadayappan & Hari Sundar (2023). Scalable parallelization for the solution of phonon Boltzmann Transport Equation. ACM International Conference on Supercomputing ICS'23. Published, 06/21/2023.
- Eric Heisler, Aadesh Deshmukh, Sandip Mazumder, Ponnuswamy Sadayappan & Hari Sundar (2023). Multi-discretization domain specific language and code generation for differential equations. Journal of Computational Science. Vol. 68. Published, 04/10/2023.
- Milinda Fernando, David Neilsen, Yosef Zlochower, Eric W. Hirschmann & Hari Sundar (2023). Massively parallel simulations of binary black holes with adaptive wavelet multiresolution. Physical Review D. Vol. 107.
Published, 03/01/2023.
https://journals.aps.org/prd/abstract/10.1103/Phys... - Makrand Khanwale, Kumar Saurabh, Masado Ishii, Hari Sundar, James A Rossmanith & Baskar Ganapathysubramanian (2023). A projection-based, semi-implicit time-stepping approach for the Cahn-Hilliard Navier-Stokes equations on adaptive octree meshes. Journal of Computational Physics. Vol. 475.
Published, 02/01/2023.
https://www.sciencedirect.com/science/article/pii/... - Milinda Fernando, David Neilsen, Eric Hirschmann, Yosef Zlochower, Hari Sundar & George Biros (2022). A GPU-accelerated AMR solver for gravitational wave propagation. ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC22).
Published, 11/15/2022.
https://www.computer.org/csdl/proceedings-article/... - Songzhe Xu, Qiming Zhu, Milinda Fernando & Hari Sundar (2022). A finite element level set method based on adaptive octree meshes for thermal free‐surface flows. International Journal for Numerical Methods in Engineering. Vol. 123(22), 5500-5516.
Published, 11/01/2022.
https://onlinelibrary.wiley.com/doi/abs/10.1002/nm... - Makrand Khanwale, Kumar Saurabh, Milinda Fernando, Victor M Calo, Hari Sundar, James A Rossmanith & Baskar Ganapathysubramanian (2022). A fully-coupled framework for solving Cahn-Hilliard Navier-Stokes equations: Second-order, energy-stable numerical methods on adaptive octree based meshes. Computer Physics Communications. Vol. 280.
Published, 11/01/2022.
https://www.sciencedirect.com/science/article/pii/... - Han Tran, Milinda Fernando, Kumar Saurabh, Baskar Ganapathysubramanaian, Mike Kirby & Hari Sundar (2022). A scalable adaptive-matrix SPMV for heterogeneous architectures. IEEE International Parallel and Distributed Processing Symposium (IPDPS). Published, 07/15/2022.
- Eric Heisler, Aadesh Deshmukh & Hari Sundar (2022). FINCH: Domain Specific Language and Code Generation for Finite Element and Finite Volume in Julia. International Conference on Computational Science (ICCS'22).
Published, 07/15/2022.
https://link.springer.com/chapter/10.1007/978-3-03... - Milinda Fernando & Hari Sundar (2022). Scalable Local Timestepping on Octree Grids. SIAM Journal of Scientific Computing. Vol. 44(2), C156-C183.
Published, 06/01/2022.
https://epubs.siam.org/doi/abs/10.1137/20M136013X - Han D. Tran, Milinda Fernando, Kumar Saurabh, Baskar Ganapathysubramanian, Robert M. Kirby & Hari Sundar (2022). A scalable adaptive-matrix SPMV for heterogeneous architectures. International Symposium on Parallel and Distributed Processing (IPDPS'22).
Published, 05/30/2022.
https://ieeexplore.ieee.org/abstract/document/9820... - Milinda Fernando & Hari Sundar (2022). Scalable Local Timestepping on Octree Grids. SIAM Journal on Scientific Computing. Vol. 44(2), C156-C183.
Published, 04/2022.
https://doi.org/10.1137/20M136013X - Kumar Saurabh, Masado Ishii, Milinda Fernando, Boshun Gao, Kendrick Tan, Ming-Chen Hsu, Adarsh Krishnamurthy, Hari Sundar & Baskar Ganapathysubramanian (2021). Scalable adaptive PDE solvers in arbitrary domains. ACM/IEEE Supercomputing. Published, 11/15/2021.
- Kumar Saurabh, Santi Adavani, Kendrick Tan, Masado Ishii, Boshun Gao, Adarsh Krishnamurthy, Hari Sundar & Baskar Ganapathysubramanian (2021). Case study of SARS-CoV-2 transmission risk assessment in indoor environments using cloud computing resources. ACM/IEEE Supercomputing Workshop. Published, 09/15/2021.
- Kumar Saurabh, Boshun Gao, Milinda Fernando, Songzhe Xu, Makrand A. Khanwale, Biswajit Khara, Ming-Chen Hsu, Adarsh Krishnamurthy, Hari Sundar & Baskar Ganapathysubramanian (2021). Industrial scale Large Eddy Simulations with adaptive octree meshes using immersogeometric analysis. Computers & Mathematics with Applications. Vol. 97, 28-44.
Published, 09/2021.
https://doi.org/10.1016/j.camwa.2021.05.028. - Marek Baranowski, Braden Caywood, Hannah Eyre, Janaan Lake, Kevin Parker, Kincaid Savoie, Hari Sundar & Mary Hall (2018). Reproducing ParConnect for SC16. Parallel Computing. Vol. 70, 18-21.
Published, 12/17/2018.
https://doi.org/10.1016/j.parco.2017.07.004 - Janaan Lake, Qixiang Chao, Hannah Eyre, Emerson Ford, Kevin Parker, Kincaid Savoie, Mary Hall & Hari Sundar (2018). Student Cluster Competition 2017, Team University of Utah: Reproducing Vectorization of the Tersoff Multi-Body Potential on the Intel Broadwell and Intel Skylake Platforms. Parallel Computing. Vol. 79, 1-8.
Published, 11/12/2018.
https://doi.org/10.1016/j.parco.2018.06.011 - Max Carlson & Hari Sundar (2018). Utilizing GPU Parallelism to Improve Fast Spherical Harmonic Transforms. IEEE High Performance extreme Computing Conference (HPEC).
Published, 09/18/2018.
https://ieeexplore.ieee.org/abstract/document/8547... - Nishith Tirpankar & Hari Sundar (2018). Towards Triangle Counting on GPU using Stable Radix binning. IEEE High Performance extreme Computing Conference (HPEC).
Published, 09/18/2018.
https://ieeexplore.ieee.org/document/8547543 - Majid Rasouli, Vidhi Zala, Robert M. Kirby & Hari Sundar (2018). Improving Performance and Scalability of Algebraic Multigrid through a Specialized MATVEC. IEEE High Performance extreme Computing Conference (HPEC).
Published, 09/18/2018.
https://ieeexplore.ieee.org/document/8547580 - Hari Sundar (2017). Efficient Parallel Streaming Algorithms for large-scale Inverse Problems. IEEE High Performance Extreme Computing Conference (HPEC ‘17). Published, 08/28/2017.
- Parmeshwar Khurd & Hari Sundar (2017). Parallel Algorithm for the Computation of Cycles in Relative Neighborhood Graphs. 46th International Conference on Parallel Processing (ICPP).
Published, 06/20/2017.
https://doi.org/10.1109/ICPP.2017.28 - Isuru Fernando, Sanath Jayasena, Milinda Fernando & Hari Sundar (2017). A Scalable Hierarchical Semi-Separable Library for Heterogeneous Clusters. 46th International Conference on Parallel Processing (ICPP).
Published, 06/20/2017.
https://doi.org/10.1109/ICPP.2017.60 - Milinda Fernando, Dmitry Duplyakin & Hari Sundar (2017). Machine and Application Aware Partitioning for Adaptive Mesh Refinement Applications. Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'17).
Published, 06/05/2017.
https://doi.org/10.1145/3078597.3078610 - FFT, FMM, or Multigrid? A comparative Study of State-Of-the-Art Poisson Solvers for Uniform and Nonuniform Grids in the Unit Cube, Amir Gholami, Dhairya Malhotra, Hari Sundar, and George Biros, SIAM J. Sci. Comput., 38(3), C280–C306.
Published, 06/29/2016.
http://dx.doi.org/10.1137/15M1010798 - Hari Sundar, Omar Ghattas, A Nested Partitioning Algorithm for Adaptive Meshes on Heterogeneous Clusters, Proceedings of the 29th ACM on International Conference on Supercomputing (ICS15), 2015.
Published, 06/22/2015.
http://dx.doi.org/10.1145/2751205.2751246 - Comparison of multigrid algorithms for high-order continuous finite element discretizations H Sundar, G Stadler, G Biros Numerical Linear Algebra with Applications, 22 (4), 664-680.
Published, 04/02/2015.
http://dx.doi.org/10.1002/nla.1979 - Designing Scalable Out-of-core Sorting with Hybrid MPI+ PGAS Programming Models J Jose, S Potluri, H Subramoni, X Lu, K Hamidouche, K Schulz, H Sundar, ... Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models.
Published, 10/06/2014.
http://dl.acm.org/citation.cfm?id=2676880 - Algorithms for high-throughput disk-to-disk sorting H Sundar, D Malhotra, KW Schulz High Performance Computing, Networking, Storage and Analysis (SC), 2013.
Published, 11/17/2013.
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnum... - HykSort: a new variant of hypercube quicksort on distributed memory architectures H Sundar, D Malhotra, G Biros Proceedings of the 27th international ACM conference on International Conference on Supercomputing.
Published, 06/10/2013.
http://dl.acm.org/citation.cfm?id=2465442 - Parallel geometric-algebraic multigrid on unstructured forests of octrees H Sundar, G Biros, C Burstedde, J Rudi, O Ghattas, G Stadler Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis.
Published, 11/10/2012.
http://dl.acm.org/citation.cfm?id=2389055 - Nonrigid 2D/3D registration of coronary artery models with live fluoroscopy for guidance of cardiac interventions D Rivest-Henault, H Sundar, M Cheriet Medical Imaging, IEEE Transactions on 31 (8), 1557-1572.
Published, 08/2012.
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnum... - A robust and accurate approach to automatic blood vessel detection and segmentation from angiography x-ray images using multistage random forests V Gupta, A Kale, H Sundar SPIE Medical Imaging, 83152F-83152F-6.
Published, 02/23/2012.
http://proceedings.spiedigitallibrary.org/proceedi... - Coronary arteries motion modeling on 2D x-ray images Y Gao, H Sundar SPIE Medical Imaging, 83161A-83161A-6.
Published, 02/23/2012.
http://proceedings.spiedigitallibrary.org/proceedi... - Global error minimization in image mosaicing using graph connectivity and its applications in microscopy P Khurd, L Grady, R Oketokoun, H Sundar, T Gajera, S Gibbs-Strauss, ... Journal of pathology informatics 2.
Published, 02/2011.
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC331271... - Parallel fast Gauss transform RS Sampath, H Sundar, SK Veerapaneni Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.
Published, 11/13/2010.
http://dl.acm.org/citation.cfm?id=1884651 - Model-based respiratory motion compensation for image-guided cardiac interventions M Schneider, H Sundar, R Liao, J Hornegger, C Xu Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on.
Published, 06/13/2010.
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnum... - Automatic global vessel segmentation and catheter removal using local geometry information and vector field integration M Schneider, H Sundar Biomedical Imaging: From Nano to Macro, 2010 IEEE International Symposium on ...
Published, 04/14/2010.
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnum... - Image-based respiratory motion compensation for fluoroscopic coronary roadmapping Y Zhu, Y Tsin, H Sundar, F Sauer Medical Image Computing and Computer-Assisted Intervention–MICCAI 2010, 287-294.
Published, 01/01/2010.
http://link.springer.com/chapter/10.1007/978-3-642... - An efficient graph-based deformable 2D/3D registration algorithm with applications for abdominal aortic aneurysm interventions R Liao, Y Tan, H Sundar, M Pfister, A Kamen Medical Imaging and Augmented Reality, 561-570.
Published, 01/01/2010.
http://link.springer.com/chapter/10.1007/978-3-642... - Estimating myocardial motion by 4D image warping H Sundar, H Litt, D Shen Pattern recognition 42 (11), 2514-2526.
Published, 11/30/2009.
http://www.sciencedirect.com/science/article/pii/S... - Curve-based 2D-3D registration of coronary vessels for image guided procedure L Duong, R Liao, H Sundar, B Tailhades, A Meyer, C Xu SPIE Medical Imaging, 72610S-72610S-10.
Published, 02/26/2009.
http://proceedings.spiedigitallibrary.org/proceedi... - Automatic image-based cardiac and respiratory cycle synchronization and gating of image sequences H Sundar, A Khamene, L Yatziv, C Xu Medical Image Computing and Computer-Assisted Intervention–MICCAI 2009, 381-388.
Published, 01/01/2009.
http://link.springer.com/chapter/10.1007/978-3-642... - Biomechanically-constrained 4D estimation of myocardial motion H Sundar, C Davatzikos, G Biros Medical Image Computing and Computer-Assisted Intervention–MICCAI 2009, 257-265.
Published, 01/01/2009.
http://link.springer.com/chapter/10.1007/978-3-642... - Dendro: parallel algorithms for multigrid and AMR methods on 2: 1 balanced octrees RS Sampath, SS Adavani, H Sundar, I Lashuk, G Biros Proceedings of the 2008 ACM/IEEE conference on Supercomputing, 18.
Published, 11/15/2008.
http://dl.acm.org/citation.cfm?id=1413389 - Bottom-up construction and 2: 1 balance refinement of linear octrees in parallel H Sundar, RS Sampath, G Biros SIAM Journal on Scientific Computing 30 (5), 2675-2708.
Published, 08/06/2008.
http://epubs.siam.org/doi/abs/10.1137/070681727 - Low-constant parallel algorithms for finite element simulations using linear octrees H Sundar, RS Sampath, SS Adavani, C Davatzikos, G Biros Proceedings of the 2007 ACM/IEEE conference on Supercomputing, 25.
Published, 11/16/2007.
http://dl.acm.org/citation.cfm?id=1362656 - Robust computation of mutual information using spatially adaptive meshes H Sundar, D Shen, G Biros, C Xu, C Davatzikos Medical Image Computing and Computer-Assisted Intervention–MICCAI 2007, 950-958.
Published, 01/01/2007.
http://link.springer.com/chapter/10.1007/978-3-540... - Estimating myocardial fiber orientations by template warping H Sundar, D Shen, G Biros, H Litt, C Davatzikos Biomedical Imaging: Nano to Macro, 2006. 3rd IEEE International Symposium on.
Published, 04/06/2006.
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnum... - A novel 2D-3D registration algorithm for aligning fluoro images with 3D pre-op CT/MR images H Sundar, A Khamene, C Xu, F Sauer, C Davatzikos Medical Imaging, 61412K-61412K-7.
Published, 03/02/2006.
http://proceedings.spiedigitallibrary.org/proceedi... - Efficient myocyte gene delivery with complete cardiac surgical isolation in situ CR Bridges, K Gopal, DE Holt, C Yarnall, S Cole, RB Anderson, X Yin, ... The Journal of thoracic and cardiovascular surgery 130 (5), 1364. e1-1364. e8.
Published, 11/30/2005.
http://www.sciencedirect.com/science/article/pii/S... - Consistent estimation of cardiac motions by 4D image registration D Shen, H Sundar, Z Xue, Y Fan, H Litt Medical Image Computing and Computer-Assisted Intervention–MICCAI 2005, 902-910.
Published, 01/2005.
http://link.springer.com/chapter/10.1007/11566489_... - Skeleton based shape matching and retrieval H Sundar, D Silver, N Gagvani, S Dickinson Shape Modeling International, 2003, 130-139.
Published, 05/12/2003.
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnum...
Research Keywords
- scientific computing
- Parallel and Distributed Computing
- Parallel Algorithms
- Computational Sciences
Presentations
- Scalable multi-phase flows in complex domains using adaptive octree meshes, Center for Computational Mathematics, Flatiron Institute, New York, NY. Invited Talk/Keynote, Presented, 05/04/2022.
- Scalable two-phase flows in complex domains, Imperial College, London. Invited Talk/Keynote, Presented, 04/25/2022.
- Local timestepping and 4D tree-based adaptivity: Enabling spacetime adaptivity for scalable numerical simulations, Applied Physics Applied Mathematics Colloqium, Columbia University. Invited Talk/Keynote, Presented, 04/05/2022.
- Solving PDEs in space-time: 4D tree-based adaptivity, mesh-free and matrix-free approaches, Numerical Analysis and Scientific Computing Seminar, Courant Institute of Mathematical Sciences, NYU. Invited Talk/Keynote, Presented, 02/18/2022.
- Scalable adaptive PDE solvers in arbitrary domains, Department of Scientific Comput- ing, Florida State University. Invited Talk/Keynote, Presented, 10/20/2021.
- Scalable Space-time adaptivity for Simulations of Binary Black Hole Intermediate-Mass-Ratio-Inspirals, Advances and Challenges in Computational Relativity, ICERM, Brown University.
Invited Talk/Keynote,
Presented, 09/16/2020.
https://icerm.brown.edu/programs/sp-f20/w1/#worksh... - Scalable Space-Time Adaptivity for Simulations of Binary Black Hole Intermediate-Mass-Ratio-Inspirals SIAM PP'20.
Invited Talk/Keynote,
Presented, 02/13/2020.
https://meetings.siam.org/sess/dsp_programsess.cfm... - Scalable Space-time adaptivity for Simulations of Binary Black Hole Intermediate-Mass-Ratio-Inspirals, CS Colloqium, University of Illinois at Urbana-Champagne.
Invited Talk/Keynote,
Presented, 10/09/2019.
https://calendars.illinois.edu/detail/5598?eventId... - Scalable Space-time adaptivity for Simulations of Binary Black Hole Intermediate-Mass-Ratio-Inspirals, Center for Relativistic Astrophysics, Georgia Institute of Technology, Atlanta, GA.
Invited Talk/Keynote,
Presented, 06/05/2019.
https://cra.gatech.edu/event/cra-seminar-hari-sund... - A Scalable Framework for Adaptive Computational General Relativity on Heterogeneous Clusters, Oden Institute, University of Texas at Austin, Austin, TX.
Invited Talk/Keynote,
Presented, 04/25/2019.
https://www.oden.utexas.edu/about/events/1364/ - Scalability & Adaptivity: Achieving Conflicting Goals in a Heterogeneous Computing Era, Mechanical Engineering Seminar, Iowa State University, Ames, IA. Invited Talk/Keynote, Presented, 04/09/2019.
- dendro-GR: Enabling Adaptivity & Parallelism for Computational Relativity, Computational Challenges in Gravitational Wave Astronomy, IPAM, UCLA .
Invited Talk/Keynote,
Presented, 02/01/2019.
https://www.ipam.ucla.edu/programs/workshops/compu... - Parallel Fast Gauss Transform, SIAM Parallel Processing for Scientific Computing, Mar 7 2018, Waseda University, Tokyo, Japan. Invited Talk/Keynote, Presented, 03/07/2018.
- Efficient Parallel Streaming Algorithms for large-scale Inverse Problems - September 13, 2017 - IEEE High Performance Extreme Computing Conference, Waltham, MA. Conference Paper, Refereed, Presented, 09/13/2017.
- Parallel Algorithms for the Computation of Cycles in Relative Neighborhood Graphs - August 16, 2017 - 46th International Conference on Parallel Processing, Bristol, UK. Conference Paper, Refereed, Presented, 08/16/2017.
- A Scalable Hierarchical Semi-Separable Library for Heterogeneous Clusters, 46th International Conference on Parallel Processing (ICPP). Conference Paper, Refereed, Presented, 08/16/2017.
- Machine and Application Aware Partitioning for Adaptive Mesh Refinement Applications, 26th International Symposium on High-Performance Parallel and Distributed Computing, 2017. Conference Paper, Refereed, Presented, 06/12/2017.
- A Parallel Wavelet Approach for Binary Compact Object Mergers, Hyun Lim, Eric Hirschmann, David Neilsen, William Black, Matthew Anderson, Hari Sundar, Milinda Fernando.
Other,
Presented, 01/27/2017.
http://meetings.aps.org/Meeting/APR17/Session/C5.3 - H-to-P Efficiently: Solving HDG Systems via AMG Within The Nektar++ Framework,
Hari Sundar and Robert Kirby, University of Utah, USA; Spencer Sherwin, Imperial College London, United Kingdom
SIAM Annual Meeting, Boston, MA .
Invited Talk/Keynote,
Presented, 07/11/2016.
http://meetings.siam.org/sess/dsp_programsess.cfm?... - A Nested Partitioning Algorithm for Adaptive Meshes on Heterogeneous Clusters, 29th ACM on International Conference on Supercomputing, Newport Beach, CA. Conference Paper, Refereed, Presented, 06/23/2015.
- Parallel hp-Multigrid for HDG, SIAM Conference on Computational Science and Engineering, Salt Lake City, UT Feb 2015. Invited Talk/Keynote, Presented, 02/23/2015.
- A Nested Partitioning Scheme for Adaptive Meshes on Parallel Heterogeneous Clusters
SIAM Conference on Parallel Processing for Scientific Computing, Portland, OR.
Other,
Presented, 02/20/2014.
http://www.siam.org/meetings/pp14/
Research Groups
- David van Komen, Graduate Student. 01/18/2021 - present.
- LeAnn Leslie, Graduate Student. 08/17/2020 - present.
- Eric Taylor Heisler, Graduate Student. 08/26/2019 - present.
- Songzhe Xu, Postdoc. 06/03/2019 - 10/01/2021.
- Bobby King, Undergraduate Student. 01/07/2019 - 12/23/2019.
- Liam Moynihan, Graduate Student. School of Computing. 01/07/2019 - present.
- Han Duc Tran, Graduate Student. 08/20/2018 - present.
- Masado Ishii, Graduate Student. 08/20/2018 - present.
- Maxx Carlson, Graduate Student. School of Computing. 08/14/2017 - present.
- Weerahannadige Milinda Shyamala Fernando, Graduate Student. 08/17/2015 - 07/30/2021.
- Seyed Majid Rasouli-Pichahi, Graduate Student. 08/17/2015 - 07/30/2021.
- Christopher Mertin, Graduate Student. 08/17/2015 - 07/10/2017.
Languages
- Hindi, fluent.
- Tamil, fluent.
- Urdu, fluent.
Software Titles
- Dendro-GR GPU . A portable, highly-scalable, extensible, and easy-to-use public infrastructure for general relativity simulations that is able to run efficiently on modern GPU clusters. The goal of this work is to perform advanced, massively parallel numerical simulations of IMRIs with mass ratios on the order of 1/100 to generate waveforms that can be used in LIGO data analysis and to calibrate approximate methods. Release Date: 11/16/2021. Inventors: Hari Sundar, Milinda Fernando, David Nielsen.
- Dendro-LTS. Package for locally adaptive time-stepping on octree meshes. This has been extended from previous versions to support GPUs. Release Date: 09/15/2021. Inventors: Milinda Fernando, Hari Sundar.
- Finch. Finch is a Julia-based domain specific language for large-scale PDE simulations. It allows a high level mathematical description of the problem and automatically generates scalble C++ code that can be run efficiently on large HPC clusters. finchdsl.org . Release Date: 09/15/2021. Inventors: Eric Heisler, Hari Sundar .
- UQ-SKetch. Code to compress simulation data on the fly for UQ problems, thereby making the gradient and Hessian computations more efficient. Release Date: 08/18/2020. Inventors: Liam Moynihan, Milinda Fernando, Hari Sundar.
- EigenMM: Scalable Generalized Eigensolvers. We present highly scalable generalized eigensolvers for computing the full spectrum of operators, mainly for the purpose of evaluating fractional operators. Release Date: 08/03/2020. Inventors: Max Carlson, Hari Sundar.
- Saena: Scalable Algebraic Multigrid. Saena is an extremely scalable, large-scale AMG solver and preconditioner for solving large elliptic problems. Release Date: 10/15/2018. Inventors: Majid Rasouli, Hari Sundar.
- DendroGR. A portable, highly-scalable, extensible, and easy-to-use public infrastructure for general relativity simulations that will be forward-compatible with next-generation heterogeneous clusters. The goal of this work is to perform advanced, massively parallel numerical simulations of IMRIs with mass ratios on the order of 1/100 to generate waveforms that can be used in LIGO data analysis and to calibrate approximate methods. Release Date: 07/18/2018. Inventors: Hari Sundar, Milinda Fernando, David Nielsen.
- esort: energy and power efficient sorting on distributed GPU clusters. The package implements communication avoiding distributed sorting algorithms along with highly optimized GPU sorting routines for node-local sorting. The code is highly tuned and provides parallelism using MPI, OpenMP, CUDA and SIMD vectorization. Release Date: 11/16/2015.
- homg. High-order finite-element package using hexahedral elements. The code is a testbed for geometric multigrid approaches for high order discretizations. The current implementation supports setting up a combination of h and p heirarchy. The following smoothers are supported, * Jacobi * Chebyshev-accelerated Jacobi * block Jacobi * Symmetric SOR . Release Date: 02/24/2014. Inventors: Hari Sundar.
- hyksort. Highly scalable distributed sorting and selection library. The package im- plements BitonicSort, MergeSort, SampleSort and HykSort. The code is highly tuned and provides parallelism using MPI, OpenMP and SIMD vectorization. Release Date: 02/02/2014. Inventors: Hari Sundar, Dhairya Malhotra.
- pfgt. A distributed memory implementation of the fast Gauss Transform. Fast adaptive parallel algorithms to compute the sum of N Gaussians at M points using the fast Gauss Transform. We use parallel octrees and a new scheme for translating the plane-waves to efficiently handle non-uniform distributions. Release Date: 02/15/2011.
- Dendro. A C++ library for constructing and balancing octrees in parallel. It also generates hexahedral meshes from the octrees and extends PETSc’s distributed ar- ray framework to support octree-based meshing. Basic routines for solving PDEs on such meshes using the finite element method are also provided. Release Date: 03/16/2009. Inventors: Hari Sundar, Rahul Sampath.