publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
-
Learning to accelerate distributed ADMM using graph neural networksHenri Doerks, Paul Häusner, Daniel Hernández Escobar, and 1 more authorLearning for Dynamics & Control (L4DC), 2026Distributed optimization is fundamental in large-scale machine learning and control applications. Among existing methods, the Alternating Direction Method of Multipliers (ADMM) has gained popularity due to its strong convergence guarantees and suitability for decentralized computation. However, ADMM often suffers from slow convergence and sensitivity to hyperparameter choices. In this work, we show that distributed ADMM iterations can be naturally represented within the message-passing framework of graph neural networks (GNNs). Building on this connection, we propose to learn adaptive step sizes and communication weights by a graph neural network that predicts the hyperparameters based on the iterates. By unrolling ADMM for a fixed number of iterations, we train the network parameters end-to-end to minimize the final iterates error for a given problem class, while preserving the algorithm’s convergence properties. Numerical experiments demonstrate that our learned variant consistently improves convergence speed and solution quality compared to standard ADMM. The code is available at https://github.com/paulhausner/learning-distributed-admm.
-
Warm-starting active-set solvers using graph neural networksElla J Schmidtobreick, Daniel Arnström, Paul Häusner, and 1 more authorLearning for Dynamics & Control (L4DC), 2026Quadratic programming (QP) solvers are widely used in real-time control and optimization, but their computational cost often limits applicability in time-critical settings. We propose a learning-to-optimize approach using graph neural networks (GNNs) to predict active sets in the dual active-set solver DAQP. The method exploits the structural properties of QPs by representing them as bipartite graphs and learning to identify the optimal active set for efficiently warm-starting the solver. Across varying problem sizes, the GNN consistently reduces the number of solver iterations compared to cold-starting, while performance is comparable to a multilayer perceptron (MLP) baseline. Furthermore, a GNN trained on varying problem sizes generalizes effectively to unseen dimensions, demonstrating flexibility and scalability. These results highlight the potential of structure-aware learning to accelerate optimization in real-time applications such as model predictive control.
-
Observation-dependent Bayesian active learning via input-warped Gaussian processesSanna Jarl, Maria Bånkestad, Jonathan JS Scragg, and 1 more authorarXiv preprint arXiv:2602.01898, 2026Bayesian active learning relies on the precise quantification of predictive uncertainty to explore unknown function landscapes. While Gaussian process surrogates are the standard for such tasks, an underappreciated fact is that their posterior variance depends on the observed outputs only through the hyperparameters, rendering exploration largely insensitive to the actual measurements. We propose to inject observation-dependent feedback by warping the input space with a learned, monotone reparameterization. This mechanism allows the design policy to expand or compress regions of the input space in response to observed variability, thereby shaping the behavior of variance-based acquisition functions. We demonstrate that while such warps can be trained via marginal likelihood, a novel self-supervised objective yields substantially better performance. Our approach improves sample efficiency across a range of active learning benchmarks, particularly in regimes where non-stationarity challenges traditional methods.
-
Toward Super‐Resolution Reconstruction of Diffusion–Relaxation MRI Using Slice Excitation With Random Overlap (SERO)Felix Mortensen, Jakub Jurek, Jens Sjölund, and 5 more authorsMagnetic Resonance in Medicine, 2026Diffusion MRI probes tissue microstructure, but low SNR and limited resolution hinder detection of features and parameter estimates. We introduce slice excitation with random overlap (SERO), which enables variable repetition times (TRs) and diffusion weighting within a single shot. This acquisition supports super‐resolution reconstruction of baseline signal (S0$ S_0 ), diffusivity (D D ), diffusional variance (V V ), and longitudinal relaxation (T1 T_1 ) maps.We implemented a diffusion‐weighted spin‐echo sequence in Pulseq that excites thick slices at random positions. Across shots, pseudo‐random overlap produces inter‐ and intra‐slice TR variation (0.15–21.9 s) with b‐values up to 1.4 ms/μm2. The T1 T_1 ‐weighting enables through‐slice super‐resolution and allows T1 T_1 $ estimation. Accuracy and precision were evaluated in numerical phantoms …
-
A computational framework for efficient porosity analysis and process parameter optimization in powder bed fusion with laser beamShashank Ramesh Babu, Cole Jetton, Jens Sjölund, and 1 more author2026The fast-paced development of new alloys for metal additive manufacturing (AM) makes it imperative to quickly quantify how material properties depend on the processing parameters. For instance, minimizing the porosity of the printed parts can help reduce corrosion, improve strength, and increase fatigue life. Porosity is also the first property optimized during process parameter development in laser beam powder bed fusion (PBF-LB). This paper demonstrates an approach for efficiently mapping the parameter space of additively manufactured components with a minimal number of samples, using porosity as an example. The workflow consists of two main steps: a neural network for efficient evaluation of the objective and a Gaussian process for an interpretable model of the process-structure-property relationship. Specifically, a convolutional neural network, U-net, segments microscopy images to quickly calculate the porosity without manual evaluation. A Gaussian process then models the porosity as a function of its process parameters, which also allows one to quantify the uncertainty of the porosity throughout the component with minimal samples. This work applies this methodology to WE43, a magnesium alloy of specific interest to biomedical applications, to find the combination of process parameters that minimize the porosity based on the laser power, scan speed, and hatch distance. This research identified a process parameter combination leading to a porosity of 0.06%, and a robust alternative yielding a porosity of 0.07% with reduced standard deviation. Additionally, the paper examines the effect of sample size during model construction …
-
Trials and Tribulations of High-Dimensional Electrolyte Design for Aqueous Zinc Batteries Using Bayesian OptimizationJackie Yik, Lukas Lindén Thöming, Viktor Vanoppen, and 3 more authors2026 -
Exploring Automated Approach and Data-Driven Insights for Optimization of Electrolyte Additive Blends for Li-ion BatteriesJackie Yik, Michael Berg, Lukas Lindén Thöming, and 3 more authors2026 -
A parallel algorithm for generating Pareto-optimal radiosurgery treatment plansJoakim Silva, Daniel Hernández Escobar, Tor Kjellsson Lindblom, and 2 more authorsMedical Physics, 2026Using inverse planning tools to create radiotherapy treatment plans is an iterative process, where clinical trade-offs are explored by changing the relative importance of different objectives and rerunning the optimizer until a desirable plan is found. We seek to optimize hundreds of radiosurgery treatment plans, corresponding to different weightings of objectives, fast enough to incorporate interactive Pareto navigation of clinical trade-offs into the clinical workflow. We apply the alternating direction method of multipliers (ADMM) to the linear-program formulation of the optimization problem used in Lightning. We implement both a CPU and a GPU version of ADMM in Matlab and compare them to Matlab’s built-in, single-threaded dual-simplex solver. The ADMM implementation is adapted to the optimization procedure used in the clinical software, with a bespoke algorithm for maximizing overlap between low-dose points for different objective weights. The method is evaluated on a test dataset consisting of 20 cases from three different indications, with between one and nine targets and total target volumes ranging from 0.66 to 52 cm3, yielding speedups of 1.6-97 and 54-1500 times on CPU and GPU, respectively, compared to simplex. Plan quality was evaluated by rerunning the ADMM optimization 20 times, each with a different random seed, for each test case and for nine objective weightings per case. The resulting clinical metrics closely mimicked those obtained when rerunning the simplex solver, verifying the validity of the method. In conclusion, we show how ADMM can be adapted for radiosurgery plan optimization, allowing hundreds of high …
2025
-
Accelerating aqueous electrolyte design with automated full-cell battery experimentation and Bayesian optimizationJackie T Yik, Carl Hvarfner, Jens Sjölund, and 2 more authorsCell Reports Physical Science, 2025The integration of automation and data-driven methodologies offers a promising approach to accelerating materials discovery in energy storage research. Thus far, in battery research, coin-cell assembly has advanced to become nearly fully automated but remains largely disconnected from data-driven methods. To bridge the disconnect, this work presents a self-driving laboratory framework to accelerate electrolyte discovery by integrating automated coin-cell assembly, galvanostatic cycling of LiFePO4||Li4Ti5O12 organic-aqueous full cells, and Bayesian optimization for selecting subsequent experiments based on prior results. The study explored an organic-aqueous hybrid electrolyte system comprising four co-solvents and two lithium-conducting salts. Using this framework, cells with an optimized electrolyte cycled with at least 94% Coulombic efficiency. Additionally, online electrochemical mass spectrometry …
-
Taming diffusion models for image restoration: a reviewZiwei Luo, Fredrik K Gustafsson, Zheng Zhao, and 2 more authorsPhilosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 2025Diffusion models (DMs) have achieved remarkable progress in generative modelling, particularly in enhancing image quality to conform to human preferences. Recently, these models have also been applied to low-level computer vision for photo-realistic image restoration (IR) in tasks such as image denoising, deblurring and dehazing. In this review, we introduce key constructions in DMs and survey contemporary techniques that make use of DMs in solving general IR tasks. We also point out the main challenges and limitations of existing diffusion-based IR frameworks and provide potential directions for future work.
-
Conditional sampling within generative diffusion modelsZheng Zhao, Ziwei Luo, Jens Sjölund, and 1 more authorPhilosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 2025Generative diffusions are a powerful class of Monte Carlo samplers that leverage bridging Markov processes to approximate complex, high-dimensional distributions, such as those found in image processing and language models. Despite their success in these domains, an important open challenge remains: extending these techniques to sample from conditional distributions, as required in, for example, Bayesian inverse problems. In this paper, we present a comprehensive review of existing computational approaches to conditional sampling within generative diffusion models. Specifically, we highlight key methodologies that either utilize the joint distribution, or rely on (pre-trained) marginal distributions with explicit likelihoods, to construct conditional generative samplers.This article is part of the theme issue ‘Generative modelling meets Bayesian inference: a new paradigm for inverse problems’.
-
Learning incomplete factorization preconditioners for GMRESPaul Häusner, Aleix Nieto Juscafresa, and Jens SjölundIn Northern Lights Deep Learning Conference (NLDL), 2025Incomplete LU factorizations of sparse matrices are widely used as preconditioners in Krylov subspace methods to speed up solving linear systems. Unfortunately, computing the preconditioner itself can be time-consuming and sensitive to hyper-parameters. Instead, we replace the hand-engineered algorithm with a graph neural network that is trained to approximate the matrix factorization directly. To apply the output of the neural network as a preconditioner, we propose an output activation function that guarantees that the predicted factorization is invertible. Further, applying a graph neural network architecture allows us to ensure that the output itself is sparse which is desirable from a computational standpoint. We theoretically analyze and empirically evaluate different loss functions to train the learned preconditioners and show their effectiveness in decreasing the number of GMRES iterations and improving the spectral properties on synthetic data. The code is available at https://github.com/paulhausner/neural-incomplete-factorization.
-
Conditioning diffusion models by explicit forward-backward bridgingAdrien Corenflos, Zheng Zhao, Simo Särkkä, and 2 more authorsIn International Conference on Artificial Intelligence and Statistics (AISTATS), 2025Given an unconditional diffusion model targeting a joint model , using it to perform conditional simulation is still largely an open question and is typically achieved by learning conditional drifts to the denoising SDE after the fact. In this work, we express \emphexact conditional simulation within the \emphapproximate diffusion model as an inference problem on an augmented space corresponding to a partial SDE bridge. This perspective allows us to implement efficient and principled particle Gibbs and pseudo-marginal samplers marginally targeting the conditional distribution . Contrary to existing methodology, our methods do not introduce any additional approximation to the unconditional diffusion model aside from the Monte Carlo error. We showcase the benefits and drawbacks of our approach on a series of synthetic and real data examples.
-
Machine learning for in-situ composition mapping in a self-driving magnetron sputtering systemSanna Jarl, Jens Sjölund, Robert JW Frost, and 2 more authorsMaterials & design, 2025Self-driving labs (SDLs) employing automation and machine learning (ML) offer great promise for accelerating materials discovery and optimisation. However, in thin film science, SDLs are mainly restricted to solution-based methods which are easier to automate, restricting access to the broader chemical space of inorganic materials. This work advances an SDL based on magnetron co-sputtering, addressing a key challenge: rapidly generating accurate composition maps of multi-element, compositionally graded thin films. Traditional ex-situ methods are slow and error-prone; instead, we present a fast, calibration-free, in-situ ML approach to predict the deposition rate using quartz-crystal microbalance (QCM) sensors. For each sputtering source, deposition rates are sequentially learned as a function of pressure and power via active learning with Gaussian processes (GPs). The final GPs are combined with a …
-
Navigating chemical design spaces for metal-ion batteries via machine-learning-guided phase-field simulationsHamed Taghavian, Viktor Vanoppen, Erik Berg, and 2 more authorsnpj Computational Materials, 2025Metal anodes provide the highest energy density in batteries. However, they still suffer from electrode/electrolyte interface side reactions and dendrite growth, especially under fast-charging conditions. In this paper, we consider a phase-field model of electrodeposition in metal-anode batteries and provide a scalable, versatile framework for optimizing its chemical parameters. Our approach is based on Bayesian optimization and explores the parameter space with a high sample efficiency and a low computation complexity. We use this framework to find the optimal cell for suppressing dendrite growth and accelerating charging speed under constant voltage. We identify interfacial mobility as a key parameter, which should be maximized to inhibit dendrites without compromising the charging speed. The results are verified using extended simulations of dendrite evolution in charging half cells with lithium-metal anodes.
-
Exploring Modularity of Agentic Systems for Drug DiscoveryLaura Weesep, Samuel Genheden, Ola Engkvist, and 1 more authorIn European Conference on Multi-Agent Systems (EUMAS), 2025
Large-language models (LLMs) and agentic systems present exciting opportunities to accelerate drug discovery. In this study, we examine the modularity of LLM-based agentic systems for drug discovery, i.e., whether parts of the system such as the LLM and type of agent are interchangeable, a topic that has received limited attention in drug discovery. We compare the performance of different LLMs and the effectiveness of tool-calling agents versus code-generating agents. Our case study, comparing performance in orchestrating tools for chemistry and drug discovery using an LLM-as-a-judge score, shows that Claude-3.5-Sonnet, Claude-3.7-Sonnet and GPT-4o outperform alternative language models such as Llama-3.1-8B, Llama-3.1-70B, GPT-3.5-Turbo, and Nova-Micro. Although we confirm that code-generating agents outperform the tool-calling ones on average, we show that this is highly question- and model-dependent. Furthermore, the impact of replacing system prompts is dependent on the question and model, underscoring that even in this particular domain one cannot just replace components of the system without re-engineering. Our study highlights the necessity of further research into the modularity of agentic systems to enable the development of reliable and modular solutions for real-world problems.
-
Real-Time Diffusion Policies for Games: Enhancing Consistency Policies with Q-EnsemblesRuoqi Zhang, Ziwei Luo, Jens Sjölund, and 3 more authorsIn IEEE Conference on Games (CoG), 2025
Diffusion models have shown impressive performance in capturing complex and multi-modal action distributions for game agents, but their slow inference speed prevents practical deployment in real-time game environments. While consistency models offer a promising approach for one-step generation, they often suffer from training instability and performance degradation when applied to policy learning. In this paper, we present CPQE (Consistency Policy with Q-Ensembles), which combines consistency models with Q-ensembles to address these challenges. CPQE leverages uncertainty estimation through Q-ensembles to provide more reliable value function approximations, resulting in better training stability and improved performance compared to classic double Q-network methods. Our extensive experiments across multiple game scenarios demonstrate that CPQE achieves inference speeds of up to 60 Hz - a …
-
Forward-only diffusion probabilistic modelsZiwei Luo, Fredrik K Gustafsson, Jens Sjölund, and 1 more authorarXiv preprint arXiv:2505.16733, 2025This work presents a forward-only diffusion (FoD) approach for generative modelling. In contrast to traditional diffusion models that rely on a coupled forward-backward diffusion scheme, FoD directly learns data generation through a single forward diffusion process, yielding a simple yet efficient generative framework. The core of FoD is a state-dependent stochastic differential equation that involves a mean-reverting term in both the drift and diffusion functions. This mean-reversion property guarantees the convergence to clean data, naturally simulating a stochastic interpolation between source and target distributions. More importantly, FoD is analytically tractable and is trained using a simple stochastic flow matching objective, enabling a few-step non-Markov chain sampling during inference. The proposed FoD model, despite its simplicity, achieves state-of-the-art performance on various image restoration tasks. Its general applicability on image-conditioned generation is also demonstrated via qualitative results on image-to-image translation. Our code is available at https://github.com/Algolzw/FoD.
-
ESS-Flow: Training-free guidance of flow-based models as inference in source spaceAdhithyan Kalaivanan, Zheng Zhao, Jens Sjölund, and 1 more authorarXiv preprint arXiv:2510.05849, 2025Guiding pretrained flow-based generative models for conditional generation or to produce samples with desired target properties enables solving diverse tasks without retraining on paired data. We present ESS-Flow, a gradient-free method that leverages the typically Gaussian prior of the source distribution in flow-based models to perform Bayesian inference directly in the source space using Elliptical Slice Sampling. ESS-Flow only requires forward passes through the generative model and observation process, no gradient or Jacobian computations, and is applicable even when gradients are unreliable or unavailable, such as with simulation-based observations or quantization in the generation or observation process. We demonstrate its effectiveness on designing materials with desired target properties and predicting protein structures from sparse inter-residue distance measurements.
-
Probabilistic Zeeman-Doppler imaging of stellar magnetic fieldsJR Andersson, O Kochukhov, Z Zhao, and 1 more authorAstronomy & Astrophysics, 2025
-
Symmetrizable systemsHamed Taghavian and Jens SjölundIn European Control Conference (ECC), 2025
Transforming an asymmetric system into a symmetric system makes it possible to exploit the simplifying properties of symmetry in control problems. We define and characterize the family of symmetrizable systems, which can be transformed into symmetric systems by a linear transformation of their inputs and outputs. In the special case of complete symmetry, the set of symmetrizable systems is convex and verifiable by a semidefinite program. We show that a Khatri-Rao rank needs to be satisfied for a system to be symmetrizable and conclude that linear systems are generically neither symmetric nor symmetrizable.
-
Minimal positive Markov realizationsHamed Taghavian and Jens SjölundIn Conference on Decision and Control (CDC), 2025
Finding a positive state-space realization with the minimum dimension for a given transfer function is an open problem in control theory. In this paper, we focus on positive realizations in Markov form and propose a linear programming approach that computes them with a minimum dimension. Such minimum dimension of positive Markov realizations is an upper bound of the minimal positive realization dimension. However, we show that these two dimensions are equal for certain systems.
-
Dose volume histogram optimization based on quantile regressionJens Olof Sjolund and Carl Axel Håkan Nordström2025
Dose-volume criteria may be equivalently expressed in terms of quantiles. This re-formulation of a dose-volume criteria allows incorporation of dose-volume criteria into a full optimization problem. Radiotherapy treatment techniques are described that may apply optimization-based formulation of quantiles to express dose-volume criterion as an inequality involving an optimization problem, which may improve DVH modeling, improve computational speed and accuracy of radiation treatment planning, and improve the delivery accuracy and efficacy of radiation doses to a patient undergoing radiotherapy treatment.
-
Radiotherapy treatment plans using differentiable dose functionsJens Olof Sjolund2025
Techniques for generating a radiotherapy treatment plan parameter are provided. The techniques include receiving radiotherapy treatment plan information; processing the radiotherapy treatment plan information to estimate one or more radiotherapy treatment plan parameters based on a process that depends on the output of a subprocess that estimates a derivative of a dose calculation; and generating a radiotherapy treatment plan using the estimated one or more radiotherapy treatment plan parameters.
-
Methods for adaptive radiotherapyJens Olof Sjolund and Niklas Gunnarsson2025
The present invention relates to the field of radiation therapy and methods, software and systems for adaptive radiotherapy. There is provided a method for adaptive radiotherapy treatment of a patient comprising receiving a sequence of measurement data of the treatment, the measurements being captured at different time points, mapping measurement data to a representation of a treatment geometry at the time points, using the representation in a dynamical model describing how variables in the representation evolve over time, based on the dynamical model, estimating positions over time of the treatment geometry or selected parts of the treatment geometry, determining a treatment action for the patient at a defined time point using the estimation from the dynamical model and executing the treatment action at the defined time point.
-
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via ExplorationAmir Baghi, Jens Sjölund, Joakim Bergdahl, and 2 more authorsarXiv preprint arXiv:2503.13077, 2025
Multi-agent reinforcement learning has shown promise in learning cooperative behaviors in team-based environments. However, such methods often demand extensive training time. For instance, the state-of-the-art method TiZero takes 40 days to train high-quality policies for a football environment. In this paper, we hypothesize that better exploration mechanisms can improve the sample efficiency of multi-agent methods. We propose two different approaches for better exploration in TiZero: a self-supervised intrinsic reward and a random network distillation bonus. Additionally, we introduce architectural modifications to the original algorithm to enhance TiZero’s computational efficiency. We evaluate the sample efficiency of these approaches through extensive experiments. Our results show that random network distillation improves training sample efficiency by 18.8% compared to the original TiZero. Furthermore, we evaluate the qualitative behavior of the models produced by both variants against a heuristic AI, with the self-supervised reward encouraging possession and random network distillation leading to a more offensive performance. Our results highlights the applicability of our random network distillation variant in practical settings. Lastly, due to the nature of the proposed method, we acknowledge its use beyond football simulation, especially in environments with strong multi-agent and strategic aspects.
-
Personalized Privacy Amplification via Importance SamplingDominik Fay, Sebastian Mair, and Jens SjölundTransactions on Machine Learning Research, 2025
We examine the privacy-enhancing properties of importance sampling. In importance sampling, selection probabilities are heterogeneous and each selected data point is weighted by the reciprocal of its selection probability. Due to the heterogeneity of importance sampling, we express our results within the framework of personalized differential privacy. We first consider the general case where an arbitrary personalized differentially private mechanism is subsampled with an arbitrary importance sampling distribution and show that the resulting mechanism also satisfies personalized differential privacy. This constitutes an extension of the established privacy amplification by subsampling result to importance sampling. Then, for any fixed mechanism, we derive the sampling distribution that achieves the optimal sampling rate subject to a worst-case privacy constraint. Empirically, we evaluate the privacy, efficiency, and accuracy of importance sampling on the example of k-means clustering.
-
Porosity predictions in additive manufacturing of Mg alloy WE43 using a Gaussian Process methodologyShashank Ramesh Babu, Jens Sjölund, and Cecilia PerssonIn 34th Annual Conference of the European Society for Biomaterials, Turin, September 7-11, 2025, 2025
2024
-
Controlling Vision-Language Models for Multi-Task Image RestorationZiwei Luo, Fredrik K Gustafsson, Zheng Zhao, and 2 more authorsIn International Conference on Learning Representations (ICLR), 2024Vision-language models such as CLIP have shown great impact on diverse downstream tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such as image restoration their performance deteriorates dramatically due to corrupted inputs. In this paper, we present a degradation-aware vision-language model (DA-CLIP) to better transfer pretrained vision-language models to low-level vision tasks as a multi-task framework for image restoration. More specifically, DA-CLIP trains an additional controller that adapts the fixed CLIP image encoder to predict high-quality feature embeddings. By integrating the embedding into an image restoration network via cross-attention, we are able to pilot the model to learn a high-fidelity image reconstruction. The controller itself will also output a degradation feature that matches the real corruptions of the input, yielding a natural classifier for different degradation types. In addition, we construct a mixed degradation dataset with synthetic captions for DA-CLIP training. Our approach advances state-of-the-art performance on both \emphdegradation-specific and \emphunified image restoration tasks, showing a promising direction of prompting image restoration with large-scale pretrained vision-language models. Our code is available at https://github.com/Algolzw/daclip-uir.
-
Neural incomplete factorization: learning preconditioners for the conjugate gradient methodPaul Häusner, Ozan Öktem, and Jens SjölundTransactions on Machine Learning Research, 2024The convergence of the conjugate gradient method for solving large-scale and sparse linear equation systems depends on the spectral properties of the system matrix, which can be improved by preconditioning. In this paper, we develop a computationally efficient data-driven approach to accelerate the generation of effective preconditioners. We, therefore, replace the typically hand-engineered preconditioners by the output of graph neural networks. Our method generates an incomplete factorization of the matrix and is, therefore, referred to as neural incomplete factorization (NeuralIF). Optimizing the condition number of the linear system directly is computationally infeasible. Instead, we utilize a stochastic approximation of the Frobenius loss which only requires matrix-vector multiplications for efficient training. At the core of our method is a novel message-passing block, inspired by sparse matrix theory, that aligns with the objective of finding a sparse factorization of the matrix. We evaluate our proposed method on both synthetic problem instances and on problems arising from the discretization of the Poisson equation on varying domains. Our experiments show that by using data-driven preconditioners within the conjugate gradient method we are able to speed up the convergence of the iterative procedure. The code is available at https://github.com/paulhausner/neural-incomplete-factorization.
-
Entropy-regularized diffusion policy with q-ensembles for offline reinforcement learningRuoqi Zhang, Ziwei Luo, Jens Sjölund, and 2 more authorsAdvances in Neural Information Processing Systems (NeurIPS), 2024Diffusion policy has shown a strong ability to express complex action distributions in offline reinforcement learning (RL). However, it suffers from overestimating Q-value functions on out-of-distribution (OOD) data points due to the offline dataset limitation. To address it, this paper proposes a novel entropy-regularized diffusion policy and takes into account the confidence of the Q-value prediction with Q-ensembles. At the core of our diffusion policy is a mean-reverting stochastic differential equation (SDE) that transfers the action distribution into a standard Gaussian form and then samples actions conditioned on the environment state with a corresponding reverse-time process. We show that the entropy of such a policy is tractable and that can be used to increase the exploration of OOD samples in offline RL training. Moreover, we propose using the lower confidence bound of Q-ensembles for pessimistic Q-value function estimation. The proposed approach demonstrates state-of-the-art performance across a range of tasks in the D4RL benchmarks, significantly improving upon existing diffusion-based policies. The code is available at https://github. com/ruoqizzz/entropy-offlineRL.
-
Photo-realistic image restoration in the wild with controlled vision-language modelsZiwei Luo, Fredrik K Gustafsson, Zheng Zhao, and 2 more authorsIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Though diffusion models have been successfully applied to various image restoration (IR) tasks their performance is sensitive to the choice of training datasets. Typically diffusion models trained in specific datasets fail to recover images that have out-of-distribution degradations. To address this problem this work leverages a capable vision-language model and a synthetic degradation pipeline to learn image restoration in the wild (wild IR). More specifically all low-quality images are simulated with a synthetic degradation pipeline that contains multiple common degradations such as blur resize noise and JPEG compression. Then we introduce robust training for a degradation-aware CLIP model to extract enriched image content features to assist high-quality image restoration. Our base diffusion model is the image restoration SDE (IR-SDE). Built upon it we further present a posterior sampling strategy for fast noise-free image generation. We evaluate our model on both synthetic and real-world degradation datasets. Moreover experiments on the unified image restoration task illustrate that the proposed posterior sampling improves image generation quality for various degradations.
-
NTIRE 2024 restore any image model (RAIM) in the wild challengeJie Liang, Radu Timofte, Qiaosi Yi, and 45 more authorsIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
In this paper we review the NTIRE 2024 challenge on Restore Any Image Model (RAIM) in the Wild. The RAIM challenge constructed a benchmark for image restoration in the wild including real-world images with/without reference ground truth in various scenarios from real applications. The participants were required to restore the real-captured images from complex and unknown degradation where generative perceptual quality and fidelity are desired in the restoration result. The challenge consisted of two tasks. Task one employed real referenced data pairs where quantitative evaluation is available. Task two used unpaired images and a comprehensive user study was conducted. The challenge attracted more than 200 registrations where 39 of them submitted results with more than 400 submissions. Top-ranked methods improved the state-of-the-art restoration performance and obtained unanimous recognition from all 18 judges. The proposed datasets are available at https://drive. google. com/file/d/1DqbxUoiUqkAIkExu3jZAqoElr_nu1IXb/view? usp= sharing and the homepage of this challenge is at https://codalab. lisn. upsaclay. fr/competitions/17632.
-
Parameter search in radiotherapy treatment plan optimizationJens Olof Sjolund2024
Techniques for generating a radiotherapy treatment plan are provided. The techniques include receiving a radiotherapy optimization problem, the radiotherapy problem comprising a plurality of parameters; processing the radiotherapy optimization problem to instantiate a first set of candidate parameters; converting the first set of candidate parameters into an adapted representation; defining an adapted radiotherapy optimization problem as a function of the adapted representation such that a given solution to the adapted optimization problem estimates a solution to the radiotherapy optimization problem; processing the adapted radiotherapy optimization problem to estimate a function of the solution to the adapted radiotherapy optimization problem; and processing the estimated function of the solution to the adapted optimization problem to generate a deliverable radiotherapy treatment plan.
-
Ising on the graph: Task-specific graph subsampling via the Ising modelMaria Bånkestad, Jennifer R Andersson, Sebastian Mair, and 1 more authorIn Learning on Graphs (LoG), 2024Reducing a graph while preserving its overall properties is an important problem with many applications. Typically, reduction approaches either remove edges (sparsification) or merge nodes (coarsening) in an unsupervised way with no specific downstream task in mind. In this paper, we present an approach for subsampling graph structures using an Ising model defined on either the nodes or edges and learning the external magnetic field of the Ising model using a graph neural network. Our approach is task-specific as it can learn how to reduce a graph for a specific downstream task in an end-to-end fashion without requiring a differentiable loss function for the task. We showcase the versatility of our approach on four distinct applications: image segmentation, explainability for graph classification, 3D shape sparsification, and sparse approximate matrix inverse determination.
-
Archetypal Analysis++: Rethinking the Initialization StrategySebastian Mair and Jens SjölundTransactions on Machine Learning Research, 2024
Archetypal analysis is a matrix factorization method with convexity constraints. Due to local minima, a good initialization is essential, but frequently used initialization methods yield either sub-optimal starting points or are prone to get stuck in poor local minima. In this paper, we propose archetypal analysis++ (AA++), a probabilistic initialization strategy for archetypal analysis that sequentially samples points based on their influence on the objective function, similar to -means++. In fact, we argue that -means++ already approximates the proposed initialization method. Furthermore, we suggest to adapt an efficient Monte Carlo approximation of -means++ to AA++. In an extensive empirical evaluation of 15 real-world data sets of varying sizes and dimensionalities and considering two pre-processing strategies, we show that AA++ almost always outperforms all baselines, including the most frequently used ones.
-
Exploring metal electroplating for energy storage by quartz crystal microbalance: a reviewViktor Vanoppen, Diethelm Johannsmann, Xu Hou, and 3 more authors2024
The development and application of Electrochemical Quartz Crystal Microbalance (EQCM) sensing to study metal electroplating, especially for energy storage purposes, are reviewed. The roles of EQCM in describing electrode/electrolyte interface dynamics, such as the electric double‐layer build‐up, ionic/molecular adsorption, metal nucleation, and growth, are addressed. Modeling of the QCM sensor is introduced and its importance is emphasized. Challenges of metal electrode use, including side reactions and dendrite formation, along with their mitigation strategies are reviewed. Numerous factors affecting the electroplating processes, such as electrolyte composition, additives, temperature, and current density, and their influence on the electroplated metals’ mass, structural, and mechanical characteristics are discussed. Looking forward, the need for deeper fundamental understanding and advancing …
-
Efficient Radiation Treatment Planning based on Voxel ImportanceSebastian Mair, Anqi Fu, and Jens SjölundPhysics in Medicine & Biology, 2024
Objective Radiation treatment planning (RTP) involves optimization over a large number of voxels, many of which carry limited information about the clinical problem. We propose an approach to reduce the large optimization problem by only using a representative subset of informative voxels. This way, we drastically improve planning efficiency while maintaining the plan quality. Approach Within an initial probing step, we pre-solve an easier optimization problem involving a simplified objective from which we derive an importance score per voxel. This importance score is then turned into a sampling distribution, which allows us to subsample a small set of informative voxels using importance sampling. By solving a—now reduced—version of the original optimization problem using this subset, we effectively reduce the problem’s size and computational demands while accounting for regions where …
-
Radiotherapy plan parameters with privacy guaranteesDominik Fay and Jens Olof Sjolund2024
Techniques for producing segmentation with privacy are provided. The techniques include receiving a medical image; processing the medical image with a student machine learning model to estimate radiotherapy plan parameters, the student machine learning model being trained to establish a relationship between a plurality of public training medical images and corresponding radiotherapy plan parameters, the radiotherapy plan parameters of the plurality of public training medical images being generated by aggregating a plurality of radiotherapy plan parameter estimates produced by: processing the plurality of public training medical images with a plurality of teacher machine learning models to generate sets of radiotherapy plan parameter estimates; and reducing respective dimensions of the sets of radiotherapy plan parameter estimates or medical images, the radiotherapy plan parameters of the plurality of …
-
On Feynman–Kac training of partial Bayesian neural networksZheng Zhao, Sebastian Mair, Thomas B Schön, and 1 more authorIn International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Recently, partial Bayesian neural networks (pBNNs), which only consider a subset of the parameters to be stochastic, were shown to perform competitively with full Bayesian neural networks. However, pBNNs are often multi-modal in the latent variable space and thus challenging to approximate with parametric models. To address this problem, we propose an efficient sampling-based training strategy, wherein the training of a pBNN is formulated as simulating a Feynman-Kac model. We then describe variations of sequential Monte Carlo samplers that allow us to simultaneously estimate the parameters and the latent posterior distribution of this model at a tractable computational cost. Using various synthetic and real-world datasets we show that our proposed training scheme outperforms the state of the art in terms of predictive performance.
-
Online learning in motion modeling for intra-interventional image sequencesNiklas Gunnarsson, Jens Sjölund, Peter Kimstrand, and 1 more author2024
Image monitoring and guidance during medical examinations can aid both diagnosis and treatment. However, the sampling frequency is often too low, which creates a need to estimate the missing images. We present a probabilistic motion model for sequential medical images, with the ability to both estimate motion between acquired images and forecast the motion ahead of time. The core is a low-dimensional temporal process based on a linear Gaussian state-space model with analytically tractable solutions for forecasting, simulation, and imputation of missing samples. The results, from two experiments on publicly available cardiac datasets, show reliable motion estimates and an improved forecasting performance using patient-specific adaptation by online learning.
-
Continuum radiotherapy treatment planningJens Olof Sjolund and Carl Axel Håkan Nordström2024
Systems and methods are disclosed for dynamic radiotherapy treatment planning in a continuous space of computation. Example operations for generating treatment plan data for a radiotherapy treatment include: obtaining data for a radiotherapy treatment of a human subject; generating a set of radiation controls from the data for the radiotherapy treatment, with at least one of the radiation controls being based on a mapping from a continuous (eg, infinite dimensional) computational space; converting the generated set of radiation controls to a set of treatment delivery parameters, the set of treatment delivery parameters corresponding to capabilities of a radiotherapy treatment machine; and producing treatment plan data for the radiotherapy treatment based on the set of treatment delivery parameters.
2023
-
Image restoration with mean-reverting stochastic differential equationsZiwei Luo, Fredrik K Gustafsson, Zheng Zhao, and 2 more authorsIn International Conference on Machine Learning (ICML), 2023This paper presents a stochastic differential equation (SDE) approach for general-purpose image restoration. The key construction consists in a mean-reverting SDE that transforms a high-quality image into a degraded counterpart as a mean state with fixed Gaussian noise. Then, by simulating the corresponding reverse-time SDE, we are able to restore the origin of the low-quality image without relying on any task-specific prior knowledge. Crucially, the proposed mean-reverting SDE has a closed-form solution, allowing us to compute the ground truth time-dependent score and learn it with a neural network. Moreover, we propose a maximum likelihood objective to learn an optimal reverse trajectory that stabilizes the training and improves the restoration results. The experiments show that our proposed method achieves highly competitive performance in quantitative comparisons on image deraining, deblurring, and denoising, setting a new state-of-the-art on two deraining datasets. Finally, the general applicability of our approach is further demonstrated via qualitative results on image super-resolution, inpainting, and dehazing. Code is available at https://github.com/Algolzw/image-restoration-sde.
-
Refusion: Enabling large-size realistic image restoration with latent-space diffusion modelsZiwei Luo, Fredrik K Gustafsson, Zheng Zhao, and 2 more authorsIn Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023
This work aims to improve the applicability of diffusion models in realistic image restoration. Specifically, we enhance the diffusion model in several aspects such as network architecture, noise level, denoising steps, training image size, and optimizer/scheduler. We show that tuning these hyperparameters allows us to achieve better performance on both distortion and perceptual scores. We also propose a U-Net based latent diffusion model which performs diffusion in a low-resolution latent space while preserving high-resolution information from the original input for the decoding process. Compared to the previous latent-diffusion model which trains a VAE-GAN to compress the image, our proposed U-Net compression strategy is significantly more stable and can recover highly accurate images without relying on adversarial optimization. Importantly, these modifications allow us to apply diffusion models to various image restoration tasks, including real-world shadow removal, HR non-homogeneous dehazing, stereo super-resolution, and bokeh effect transformation. By simply replacing the datasets and slightly changing the noise network, our model, named Refusion, is able to deal with large-size images (eg, 6000 x 4000 x 3 in HR dehazing) and produces good results on all the above restoration problems. Our Refusion achieves the best perceptual performance in the NTIRE 2023 Image Shadow Removal Challenge and wins 2nd place overall.
-
NTIRE 2023 challenge on stereo image super-resolution: Methods and resultsLongguang Wang, Yulan Guo, Yingqian Wang, and 124 more authorsIn Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023
In this paper, we summarize the 2nd NTIRE challenge on stereo image super-resolution (SR) with a focus on new solutions and results. The task of the challenge is to super-resolve a low-resolution stereo image pair to a high-resolution one with a magnification factor of x4. Compared with single image SR, the major challenge of this challenge lies in how to exploit additional information in another viewpoint and how to maintain stereo consistency in the results. This challenge has 3 tracks, including one track on distortion (eg, PSNR) and bicubic degradation, one track on perceptual quality (eg, LPIPS) and bicubic degradation, as well as another track on real degradations. In total, 175, 93, and 103 participants were successfully registered for each track, respectively. In the test phase, 21, 17, and 12 teams successfully submitted results with PSNR (RGB) scores better than the baseline. This challenge establishes a new benchmark for stereo image SR.
-
Ntire 2023 hr nonhomogeneous dehazing challenge reportCodruta O Ancuti, Cosmin Ancuti, Florin-Alexandru Vasluianu, and 78 more authorsIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
This study assesses the outcomes of the NTIRE 2023 Challenge on Non-Homogeneous Dehazing, wherein novel techniques were proposed and evaluated on new image dataset called HD-NH-HAZE. The HD-NH-HAZE dataset contains 50 high resolution pairs of real-life outdoor images featuring nonhomogeneous hazy images and corresponding haze-free images of the same scene. The nonhomogeneous haze was simulated using a professional setup that replicated real-world conditions of hazy scenarios. The competition had 246 participants and 17 teams that competed in the final testing phase, and the proposed solutions demonstrated the cutting-edge in image dehazing technology.
-
Automated electrolyte formulation and coin cell assembly for high-throughput lithium-ion battery researchJackie T Yik, Leiting Zhang, Jens Sjölund, and 4 more authorsDigital Discovery, 2023
Battery cell assembly and testing in conventional battery research is acknowledged to be heavily time-consuming and often suffers from large cell-to-cell variations. Manual battery cell assembly and electrolyte formulations are prone to introducing errors which confound optimization strategies and upscaling. Herein we present ODACell, an automated electrolyte formulation and battery assembly setup, capable of preparing large batches of coin cells. We demonstrate the feasibility of Li-ion cell assembly in an ambient atmosphere by preparing LiFePO4‖Li4Ti5O12-based full cells with dimethyl sulfoxide-based model electrolyte. Furthermore, the influence of water is investigated to account for the hygroscopic nature of the non-aqueous electrolyte when exposed to ambient atmosphere. The reproducibility tests demonstrate a conservative fail rate of 5%, while the relative standard deviation of the discharge capacity …
-
Lens-to-lens bokeh effect transformation. NTIRE 2023 challenge reportMarcos V Conde, Manuel Kolmet, Tim Seizinger, and 37 more authorsIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
We present the new Bokeh Effect Transformation Dataset (BETD), and review the proposed solutions for this novel task at the NTIRE 2023 Bokeh Effect Transformation Challenge. Recent advancements of mobile photography aim to reach the visual quality of full-frame cameras. Now, a goal in computational photography is to optimize the Bokeh effect itself, which is the aesthetic quality of the blur in out-of-focus areas of an image. Photographers create this aesthetic effect by benefiting from the lens optical properties. The aim of this work is to design a neural network capable of converting the the Bokeh effect of one lens to the effect of another lens without harming the sharp foreground regions in the image. For a given input image, knowing the target lens type, we render or transform the Bokeh effect accordingly to the lens properties. We build the BETD using two full-frame Sony cameras, and diverse lens setups. To the best of our knowledge, we are the first attempt to solve this novel task, and we provide the first BETD dataset and benchmark for it. The challenge had 99 registered participants. The submitted methods gauge the state-of-the-art in Bokeh effect rendering and transformation.
-
Variational Elliptical ProcessesMaria Bånkestad, Jens Sjölund, Jalil Taghia, and 1 more authorTransactions on Machine Learning Research, 2023We present elliptical processes, a family of non-parametric probabilistic models that subsume Gaussian processes and Student’s t processes. This generalization includes a range of new heavy-tailed behaviors while retaining computational tractability. Elliptical processes are based on a representation of elliptical distributions as a continuous mixture of Gaussian distributions. We parameterize this mixture distribution as a spline normalizing flow, which we train using variational inference. The proposed form of the variational posterior enables a sparse variational elliptical process applicable to large-scale problems. We highlight advantages compared to Gaussian processes through regression and classification experiments. Elliptical processes can supersede Gaussian processes in several settings, including cases where the likelihood is non-Gaussian or when accurate tail modeling is essential.
-
Icml 2023 topological deep learning challenge: Design and resultsMathilde Papillon, Mustafa Hajij, Audun Myers, and 49 more authorsIn Topological, Algebraic and Geometric Learning Workshops 2023, 2023
This paper presents the computational challenge on topological deep learning that was hosted within the ICML 2023 Workshop on Topology and Geometry in Machine Learning. The competition asked participants to provide open-source implementations of topological neural networks from the literature by contributing to the python packages TopoNetX (data processing) and TopoModelX (deep learning). The challenge attracted twenty-eight qualifying submissions in its two month duration. This paper describes the design of the challenge and summarizes its main findings.
-
Probabilistic estimation of instantaneous frequencies of chirp signalsZheng Zhao, Simo Särkkä, Jens Sjölund, and 1 more authorIEEE Transactions on Signal Processing, 2023
We present a continuous-time probabilistic approach for estimating the chirp signal and its instantaneous frequency function when the true forms of these functions are not accessible. Our model represents these functions by non-linearly cascaded Gaussian processes represented as non-linear stochastic differential equations. The posterior distribution of the functions is then estimated with stochastic filters and smoothers. We compute a (posterior) Cramér–Rao lower bound for the Gaussian process model, and derive a theoretical upper bound for the estimation error in the mean squared sense. The experiments show that the proposed method outperforms a number of state-of-the-art methods on a synthetic data. We also show that the method works out-of-the-box for two real-world datasets.
-
A tutorial on parametric variational inferenceJens Sjölund2023
Variational inference uses optimization, rather than integration, to approximate the marginal likelihood, and thereby the posterior, in a Bayesian model. Thanks to advances in computational scalability made in the last decade, variational inference is now the preferred choice for many high-dimensional models and large datasets. This tutorial introduces variational inference from the parametric perspective that dominates these recent developments, in contrast to the mean-field perspective commonly found in other introductory texts.
-
Radiotherapy treatment plan optimization using machine learningJonas Anders Adler and Jens Olof Sjölund2023
-
Adaptive hyperparameter selection for differentially private gradient descentDominik Fay, Sindri Magnússon, Jens Sjölund, and 1 more authorTransactions on Machine Learning Research, 2023
We present an adaptive mechanism for hyperparameter selection in differentially private optimization that addresses the inherent trade-off between utility and privacy. The mechanism eliminates the often unstructured and time-consuming manual effort of selecting hyperparameters and avoids the additional privacy costs that hyperparameter selection otherwise incurs on top of that of the actual algorithm. We instantiate our mechanism for noisy gradient descent on non-convex, convex and strongly convex loss functions, respectively, to derive schedules for the noise variance and step size. These schedules account for the properties of the loss function and adapt to convergence metrics such as the gradient norm. When using these schedules, we show that noisy gradient descent converges at essentially the same rate as its noise-free counterpart. Numerical experiments show that the schedules consistently perform well across a range of datasets without manual tuning.
-
Risk-sensitive Actor-free Policy via Convex OptimizationRuoqi Zhang and Jens SjölundIn IJCAI AISafety and SafeRL Joint Workshop, 2023
Traditional reinforcement learning methods optimize agents without considering safety, potentially resulting in unintended consequences. In this paper, we propose an optimal actor-free policy that optimizes a risk-sensitive criterion based on the conditional value at risk. The risk-sensitive objective function is modeled using an input-convex neural network ensuring convexity with respect to the actions and enabling the identification of globally optimal actions through simple gradient-following methods. Experimental results demonstrate the efficacy of our approach in maintaining effective risk control.
-
Exploration of Pareto-optimal radiotherapy plansJens Olof Sjölund and Carl Axel Håkan Nordström2023
Systems and methods are disclosed for exploration and adaptation of radiotherapy treatment plans. Example operations for radiotherapy treatment planning include: obtaining a plurality of solutions (eg, Pareto-optimal solutions) of a radiotherapy problem, exploring the plurality of solutions to identify an additional solution in a submanifold space (eg, exploration of a Pareto surface), and generating treatment plan parameters based on the additional solution for use in a radiation therapy treatment. In an example, exploring the plurality of solutions includes: establishing a submanifold space from a manifold space representing the plurality of solutions in fewer dimensions than the weights; producing additional sets of weights in the submanifold space based on derivatives of first-order optimality conditions of the radiotherapy problem, the derivatives determined with respect to the weights; and navigating in the …
-
Parallel generation of Pareto optimal radiotherapy plansJens Olof Sjölund and Carl Axel Håkan Nordström2023
APTZNLHMIGJTEW-UHFFFAOYSA-N pyraflufen-ethyl Chemical compound C1= C (Cl) C (OCC (= O) OCC)= CC (C= 2C (= C (OC (F) F) N (C) N= 2) Cl)= C1F APTZNLHMIGJTEW-UHFFFAOYSA-N 0.000 description 1
-
Generative model of phase spaceJens Olof Sjolund and Carl Axel Håkan Nordström2023
-
Inferring clinical preferences from dataJens Olof Sjolund2023
-
Treatment planningJens Olof Sjolund, Carl Axel Håkan Nordström, and John Henry Dahlberg2023
-
Diffusion-Based 3D Motion Estimation from Sparse 2D ObservationsNiklas Gunnarsson, Thomas B Schön, Jens Sjölund, and 1 more author2023
Intra-interventional imaging is a tool for monitoring and guiding ongoing treatment sessions. Ideally one would like the full 3D image at high temporal resolution, this is however not possible due to the acquisition time. In this study, we consider the scenario when the observations are sparse and consist only of 2D image slices through the 3D volume. Given 2D-2D image registrations between a predefined 3D volume and the observations, we propose a method to estimate the full 3D motion. This 3D motion enables the reconstruction of the 3D anatomy. Our method relies on a conditioning-based denoising diffusion model and generates estimates given the 2D sparse observations. We reduce the dimensionality of the diffusion process by embedding the data in a lower dimensional space using principal component analysis. The model is evaluated in two experiments: first on synthetically generated data and then using medical lung images. Our observations show that the estimates are stable across the entire volume and within 1 mm of the lower bound defined by the reconstruction error.
2022
-
Learn2Reg: comprehensive multi-task medical image registration challenge, dataset and evaluation in the era of deep learningAlessa Hering, Lasse Hansen, Tony CW Mok, and 50 more authorsIEEE Transactions on Medical Imaging, 2022
Image registration is a fundamental medical image analysis task, and a wide variety of approaches have been proposed. However, only a few studies have comprehensively compared medical image registration approaches on a wide range of clinically relevant tasks. This limits the development of registration methods, the adoption of research advances into practice, and a fair benchmark across competing approaches. The Learn2Reg challenge addresses these limitations by providing a multi-task medical image registration data set for comprehensive characterisation of deformable registration algorithms. A continuous evaluation will be possible at https://learn2reg.grand-challenge.org. Learn2Reg covers a wide range of anatomies (brain, abdomen, and thorax), modalities (ultrasound, CT, MR), availability of annotations, as well as intra- and inter-patient registration evaluation. We established an easily accessible …
-
Graph-based neural acceleration for nonnegative matrix factorizationJens Sjölund and Maria BånkestadarXiv preprint arXiv:2202.00264, 2022
We describe a graph-based neural acceleration technique for nonnegative matrix factorization that builds upon a connection between matrices and bipartite graphs that is well-known in certain fields, e.g., sparse linear algebra, but has not yet been exploited to design graph neural networks for matrix computations. We first consider low-rank factorization more broadly and propose a graph representation of the problem suited for graph neural networks. Then, we focus on the task of nonnegative matrix factorization and propose a graph neural network that interleaves bipartite self-attention layers with updates based on the alternating direction method of multipliers. Our empirical evaluation on synthetic and two real-world datasets shows that we attain substantial acceleration, even though we only train in an unsupervised fashion on smaller synthetic instances.
-
Compressing radiotherapy treatment plan optimization problemsJens Olof Sjölund2022
Woessner, PA (57) ABSTRACT Techniques for solving a radiotherapy treatment plan opti mization problem are provided. The techniques include receiving a first radiotherapy treatment plan optimization problem having a first set of parameters; processing the first set of parameters to estimate a second set of parameters of a second radiotherapy treatment plan optimization problem; generating a solution to the second radiotherapy treatment plan optimization problem based on the estimated second set of parameters; and generating a radiotherapy treatment plan based on the solution to the second radiotherapy treatment plan optimization problem.
-
Private learning via knowledge transfer with high-dimensional targetsDominik Fay, Jens Sjölund, and Tobias J OechteringIn ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
Preventing unintentional leakage of information about the training set has high relevance for many machine learning tasks, such as medical image segmentation. While differential privacy (DP) offers mathematically rigorous protection, the high output dimensionality of segmentation tasks prevents the direct application of state-of-the-art algorithms such as Private Aggregation of Teacher Ensembles (PATE). In order to alleviate this problem, we propose to learn dimensionality-reducing transformations to map the prediction target into a bounded lower-dimensional space to reduce the required noise level during the aggregation stage. To this end, we assess the suitability of principal component analysis (PCA) and autoencoders. We conclude that autoencoders are an effective means to reduce the noise in the target variables.
-
Generation of realizable radiotherapy plansJens Olof Sjölund and Jonas Anders Adler2022
Int. Ci. A6IN 5/10(2006.01) G16H 20/40(2018.01) GOON 20/00(2019.01)(52) US Ci. CPC A61N 5/103 (2013.01); A61N 5/1031 (2013.01); A61N 5/1039 (2013.01); A6IN 5/1081 (2013.01); GO6N 20/00 (2019.01); 416H 20/40 (2018.01)(58) Field of Classification Search CPC A61N 5/103; A61N 5/1031; A61N 5/1038; A61N 5/1039; A61N 2005/1041 See application file for complete search history.Techniques for generating a radiotherapy treatment plan are provided. The techniques include receiving an input param eter related to a patient, the input parameter being of a given type; processing the input parameter with a machine learn ing technique to estimate a realizable plan parameter of a radiotherapy treatment plan, wherein the machine learning technique is trained to establish a relationship between the given type of input parameter and a set of realizable radio therapy treatment plan parameters to achieve a …
-
Unsupervised dynamic modeling of medical image transformationsNiklas Gunnarsson, Jens Sjölund, Peter Kimstrand, and 1 more authorIn 2022 25th International Conference on Information Fusion (FUSION), 2022
Spatiotemporal imaging has applications in e.g. cardiac diagnostics, surgical guidance, and radiotherapy monitoring, In this paper, we explain the temporal motion by identifying the underlying dynamics, only based on the sequential images. Our dynamical model maps the inputs of observed high-dimensional sequential images to a low-dimensional latent space wherein a linear relationship between a hidden state process and the lower-dimensional representation of the inputs holds. For this, we use a conditional variational auto-encoder (CVAE) to nonlinearly map the higher dimensional image to a lower-dimensional space, wherein we model the dynamics with a linear Gaussian state-space model (LG-SSM). The model, a modified version of the Kalman variational auto-encoder, is end-to-end trainable, and the weights, both in the CVAE and LG-SSM, are simultaneously updated by maximizing the evidence …
-
Methods for inverse planningHåkan NORDSTRÖM, Stella Riad, and Jens SJÖLUND2022
Methods for dose or treatment planning for a radiotherapy system including a radiotherapy unit are provided. A spatial dose delivered can be changed by adjusting beam shape settings, and the delivered radiation is determined using an optimization problem that steers the delivered radiation according to objectives reflecting criteria for regions of interest including at least one of: targets to be treated during treatment of the patient, organs at risk and/or healthy tissue. The method includes determining an inner set of voxels and providing a first frame description for the inner set of voxels, where the first frame description reflects criteria for the inner set of voxels. Determining an outer set of voxels encom passing the target volume and the inner set of voxels and a frame description for the outer set of voxels is provided where each reflecting criteria for the outer set of voxels. The frame descriptions are then used in the …
-
NUQ: A noise metric for diffusion MRI via uncertainty discrepancy quantificationShreyas Fadnavis, Jens Sjölund, Anders Eklund, and 1 more authorarXiv preprint arXiv:2203.01921, 2022
Diffusion MRI (dMRI) is the only non-invasive technique sensitive to tissue micro-architecture, which can, in turn, be used to reconstruct tissue microstructure and white matter pathways. The accuracy of such tasks is hampered by the low signal-to-noise ratio in dMRI. Today, the noise is characterized mainly by visual inspection of residual maps and estimated standard deviation. However, it is hard to estimate the impact of noise on downstream tasks based only on such qualitative assessments. To address this issue, we introduce a novel metric, Noise Uncertainty Quantification (NUQ), for quantitative image quality analysis in the absence of a ground truth reference image. NUQ uses a recent Bayesian formulation of dMRI models to estimate the uncertainty of microstructural measures. Specifically, NUQ uses the maximum mean discrepancy metric to compute a pooled quality score by comparing samples drawn from the posterior distribution of the microstructure measures. We show that NUQ allows a fine-grained analysis of noise, capturing details that are visually imperceptible. We perform qualitative and quantitative comparisons on real datasets, showing that NUQ generates consistent scores across different denoisers and acquisitions. Lastly, by using NUQ on a cohort of schizophrenics and controls, we quantify the substantial impact of denoising on group differences.
-
Convex inverse planning methodJens SJÖLUND and Håkan NORDSTRÖM2022
A method for treatment planning for a radiation therapy system includes setting a number of objectives reflecting clinical criteria are set for the regions of interest and generating radiation dose profiles to be delivered to these regions of interest. A convex optimization function for optimizing the delivered radiation based on the objectives is provided and dose profiles for specific treatment configurations including beam shape settings for the radiation dose profiles are calculated using the convex optimization function. Treatment plans including determining the radiation dose profiles to be delivered during treatment based on the treatment configurations are created and an optimal treatment plan that satisfies the clinical criteria is selected.
2021
-
Systems and methods for optimizing treatment planningJens Olof Sjölund2021
The present disclosure relates to systems, methods, and computer-readable storage devices for radiotherapy treat ment planning. For example, a method may generate a treatment plan for a patient. The method may receive training data reflecting radiotherapy treatment data. The training data may include a feature vector and a target vector. The method may further determine a training model based on the feature vector and the target vector. The method may further receive testing data associated with the patient. The testing data may include a descriptive feature vector. The method may further determine a therapy model based on the descriptive feature vector and the training model. The therapy model may be used to generate the treatment plan. 18 Claims, 10 Drawing Sheets
-
Motion‐compensated gradient waveforms for tensor‐valued diffusion encoding by constrained numerical optimizationFilip Szczepankiewicz, Jens Sjölund, Erica Dall’Armellina, and 4 more authorsMagnetic resonance in medicine, 2021
Diffusion‐weighted MRI is sensitive to incoherent tissue motion, which may confound the measured signal and subsequent analysis. We propose a “motion‐compensated” gradient waveform design for tensor‐valued diffusion encoding that negates the effects bulk motion and incoherent motion in the ballistic regime.Motion compensation was achieved by constraining the magnitude of gradient waveform moment vectors. The constraint was incorporated into a numerical optimization framework, along with existing constraints that account for b‐tensor shape, hardware restrictions, and concomitant field gradients. We evaluated the efficacy of encoding and motion compensation in simulations, and we demonstrated the approach by linear and planar b‐tensor encoding in a healthy heart in vivo.The optimization framework produced asymmetric motion‐compensated waveforms that yielded …
-
Cross-term-compensated gradient waveform design for tensor-valued diffusion MRIFilip Szczepankiewicz and Jens SjölundJournal of magnetic resonance, 2021
Diffusion MRI uses magnetic field gradients to sensitize the signal to the random motion of spins. In addition to the prescribed gradient waveforms, background field gradients contribute to the diffusion weighting and thereby cause an error in the measured signal and consequent parameterization. The most prominent contribution to the error comes from so-called ‘cross-terms.’ In this work we present a novel gradient waveform design that enables diffusion encoding that cancels such cross-terms and yields a more accurate measurement. This is achieved by numerical optimization that maximizes encoding efficiency with a simultaneous constraint on the ‘cross-term sensitivity’ (c = 0). We found that the optimized cross-term-compensated waveforms were superior to previous cross-term-compensated designs for a wide range of waveform types that yield linear, planar, and spherical b-tensor encoding. The efficacy of …
-
Radiotherapy treatment plans using differentiable dose functionsJens Olof Sjölund2021
Techniques for generating a radiotherapy treatment plan parameter are provided. The techniques include receiving radiotherapy treatment plan information; processing the radiotherapy treatment plan information to estimate one or more radiotherapy treatment plan parameters based on a process that depends on the output of a subprocess that estimates a derivative of a dose calculation, and generating a radiotherapy treatment plan using the estimated one or more radiotherapy treatment plan parameters.
-
Computing radiotherapy dose distributionMarkus Eriksson, Jens Olof Sjölund, Linn Öström, and 3 more authors2021
Systems and methods for calculating radiotherapy dose distribution are provided. The systems and methods include operations for receiving data representing at least one of particle trajectories or a dose deposition pattern in a simu lated delivery of a radiotherapy plan; applying a dose calculation process to the received data to generate a first radiotherapy dose distribution having a first level of detail; and processing the first radiotherapy dose distribution using a trained machine learning technique to generate a second radiotherapy dose distribution having a second level of detail that enhances the first level of detail.
-
Radiotherapy planning systemsJens Sjölund2021
Embodiments of the present disclosure are directed to treat ment planning systems for a radiotherapy apparatus. In one implementation, a treatment planning system may include a computational processor configured to apply a set of instruc tions to an input data set. The input data set may include a three-dimensional dose distribution for delivery by the radiotherapy apparatus to a volume to be irradiated, a three-dimensional volume image characterizing tissue types within the volume to be irradiated, and a set of apparatus parameters which characterize the radiotherapy apparatus.
-
Methods for inverse planningHåkan NORDSTRÖM, Björn Somell, Stella Riad, and 1 more author2021
In the field of radiotherapy, methods for dose or treatment planning for a radiotherapy system are disclosed, wherein a spatial dose delivered can be adjusted and delivered radia tion is determined using an optimization problem that steers the delivered radiation according to a frame description reflecting criteria for regions of interest that include at least one of targets to be treated during treatment of the patient, organs at risk and/or healthy tissue. The method includes estimating a voxel set receiving a higher dose than a predetermined threshold dose level, which voxel set includes voxels from at least one target volume. Further, a low dose voxel set is determined and a frame description for the voxels in the low dose voxel set is provided where voxels receiving a dose exceeding a predetermined threshold value is penalized such that the dose delivered to the low dose voxel set is suppressed. The frame description …
-
Method and system for calibrationJens SJÖLUND2021
(57) ABSTRACT A method of calibrating a positioning system in a radiation therapy system which includes a radiation therapy unit having a fixed radiation focus, includes irradiating a cali bration tool having at least one reference object, capturing at least one two-dimensional image including cross-sectional representations of reference objects of the calibration tool and determining image coordinates of the representation of each reference object. Based on the reference objects’ image coordinates, positions of the reference objects in the ste reotactic coordinate system relative to an origin of the calibration tool and the position of the origin of the calibra tion tool relative to the imaging unit, a position difference between the position of the calibration tool in the stereotac tic coordinate system and a position of the calibration tool in an imaging system coordinate system including a transla tional and rotational position …
2020
-
Modality-agnostic method for medical image representationJens Olof Sjölund and Jonas Anders Adler2020
Techniques for the operation and use of a model that learns the general representation of multimodal images is dis closed. In various examples, methods from representation learning are used to find a common basis for representation of medical images. These include aspects of encoding, fusion, and downstream tasks, with use the general representation and model. In an example, a method for generating a modality-agnostic model includes receiving imaging data, encoding the imaging data by mapping data to a latent representation, fusing the encoded data to conserve latent variables corresponding to the latent representation, and training a model using the latent representation. In an example, a method for processing imaging data using a trained modality-agnostic model includes receiving imaging data, encoding the data to the defined encoding, processing the encoded data with a trained model, and …
-
Learning a deformable registration pyramidNiklas Gunnarsson, Jens Sjölund, and Thomas B Schön2020
We introduce an end-to-end unsupervised (or weakly supervised) image registration method that blends conventional medical image registration with contemporary deep learning techniques from computer vision. Our method downsamples both the fixed and the moving images into multiple feature map levels where a displacement field is estimated at each level and then further refined throughout the network. We train and test our model on three different datasets. In comparison with the initial registrations we find an improved performance using our model, yet we expect it would improve further if the model was fine-tuned for each task. The implementation is publicly available (https://github.com/ngunnar/learning-a-deformable-registration-pyramid).
-
Convex inverse planning methodJens SJÖLUND and Håkan NORDSTRÖM2020
The present invention relates to the field of radiation therapy. In particular, the present invention relates to methods for treatment planning for radiation therapy system. The method includes setting number of objectives reflecting clinical criteria are set for the regions of interest and generating radiation dose profiles to be delivered to these regions of interest Anvex optimization function for optimizing the delivered radiation based on the objectives is provided and dose profiles for specific treatment configura ons including beam shpesettings for the radiation de profiles are calculated using the convex optimization func tion Treatment plans including determining the radiation dose profiles to be delivered during treatment based on the treatment configurations are created and an optimal treat ment plan that satisfies the clinical criteria is selected.
-
System and method for automatic treatment planningJens Olof Sjolund and Xiao Han2020
The present disclosure relates to systems, methods, and computer-readable storage media for radiotherapy. Embodi ments of the present disclosure may receive a plurality of training data and determine one or more predictive models based on the training data. The one or more predictive models may be determined based on at least one of a conditional probability density associated with a selected output characteristic given one or more selected input vari ables or a joint probability density. Embodiments of the present disclosure may also receive patient specific testing data. In addition, embodiments of the present disclosure may predict a probability density associated with a characteristic output based on the one or more predictive models and the patient specific testing data. Moreover, embodiments of the present disclosure may generate a new treatment plan based on the prediction and may use the new …
-
Registration by tracking for sequential 2D MRINiklas Gunnarsson, Jens Sjölund, and Thomas B SchönarXiv preprint arXiv:2003.10819, 2020
Our anatomy is in constant motion. With modern MR imaging it is possible to record this motion in real-time during an ongoing radiation therapy session. In this paper we present an image registration method that exploits the sequential nature of 2D MR images to estimate the corresponding displacement field. The method employs several discriminative correlation filters that independently track specific points. Together with a sparse-to-dense interpolation scheme we can then estimate of the displacement field. The discriminative correlation filters are trained online, and our method is modality agnostic. For the interpolation scheme we use a neural network with normalized convolutions that is trained using synthetic diffeomorphic displacement fields. The method is evaluated on a segmented cardiac dataset and when compared to two conventional methods we observe an improved performance. This improvement is especially pronounced when it comes to the detection of larger motions of small objects.
2019
-
Tensor-valued diffusion encoding for diffusional variance decomposition (DIVIDE): Technical feasibility in clinical MRI systemsFilip Szczepankiewicz, Jens Sjölund, Freddy Ståhlberg, and 2 more authorsPloS one, 2019
Microstructure imaging techniques based on tensor-valued diffusion encoding have gained popularity within the MRI research community. Unlike conventional diffusion encoding—applied along a single direction in each shot—tensor-valued encoding employs diffusion encoding along multiple directions within a single preparation of the signal. The benefit is that such encoding may probe tissue features that are not accessible by conventional encoding. For example, diffusional variance decomposition (DIVIDE) takes advantage of tensor-valued encoding to probe microscopic diffusion anisotropy independent of orientation coherence. The drawback is that tensor-valued encoding generally requires gradient waveforms that are more demanding on hardware; it has therefore been used primarily in MRI systems with relatively high performance. The purpose of this work was to explore tensor-valued diffusion encoding on clinical MRI systems with varying performance to test its technical feasibility within the context of DIVIDE. We performed whole-brain imaging with linear and spherical b-tensor encoding at field strengths between 1.5 and 7 T, and at maximal gradient amplitudes between 45 and 80 mT/m. Asymmetric gradient waveforms were optimized numerically to yield b-values up to 2 ms/μm2. Technical feasibility was assessed in terms of the repeatability, SNR, and quality of DIVIDE parameter maps. Variable system performance resulted in echo times between 83 to 115 ms and total acquisition times of 6 to 9 minutes when using 80 signal samples and resolution 2×2×4 mm3. As expected, the repeatability, signal-to-noise ratio and parameter …
-
A linear programming approach to inverse planning in Gamma Knife radiosurgeryJens Sjölund, Stella Riad, Marcus Hennix, and 1 more authorMedical physics, 2019
Leksell Gamma Knife® is a stereotactic radiosurgery system that allows fine‐grained control of the delivered dose distribution. We describe a new inverse planning approach that both resolves shortcomings of earlier approaches and unlocks new capabilities.We fix the isocenter positions and perform sector‐duration optimization using linear programming, and study the effect of beam‐on time penalization on the trade‐off between beam‐on time and plan quality. We also describe two techniques that reduce the problem size and thus further reduce the solution time: dualization and representative subsampling.The beam‐on time penalization reduces the beam‐on time by a factor 2–3 compared with the naïve alternative. Dualization and representative subsampling each leads to optimization time‐savings by a factor 5–20. Overall, we find in a comparison with 75 clinical plans that we …
-
A unified representation network for segmentation with missing modalitiesKenneth Lau, Jonas Adler, and Jens SjölundarXiv preprint arXiv:1908.06683, 2019
Over the last few years machine learning has demonstrated groundbreaking results in many areas of medical image analysis, including segmentation. A key assumption, however, is that the train- and test distributions match. We study a realistic scenario where this assumption is clearly violated, namely segmentation with missing input modalities. We describe two neural network approaches that can handle a variable number of input modalities. The first is modality dropout: a simple but surprisingly effective modification of the training. The second is the unified representation network: a network architecture that maps a variable number of input modalities into a unified representation that can be used for downstream tasks such as segmentation. We demonstrate that modality dropout makes a standard segmentation network reasonably robust to missing modalities, but that the same network works even better if trained on the unified representation.
2018
-
System and method for automatic treatment planningJens Olof Sjölund and Xiao Han2018
The present disclosure relates to systems, methods, and computer-readable storage media for radiotherapy. Embodi ments of the present disclosure may receive a plurality of training data and determine one or more predictive models based on the training data. The one or more predictive models may be determined based on at least one of a conditional probability density associated with a selected output characteristic given one or more selected input vari ables or a joint probability density. Embodiments of the present disclosure may also receive patient specific testing data. In addition, embodiments of the present disclosure may predict a probability density associated with a characteristic output based on the one or more predictive models and the patient specific testing data. Moreover, embodiments of the present disclosure may generate a new treatment plan based on the prediction and may use the new …
-
Bayesian uncertainty quantification in linear models for diffusion MRIJens Sjölund, Anders Eklund, Evren Özarslan, and 3 more authorsNeuroImage, 2018
Diffusion MRI (dMRI) is a valuable tool in the assessment of tissue microstructure. By fitting a model to the dMRI signal it is possible to derive various quantitative features. Several of the most popular dMRI signal models are expansions in an appropriately chosen basis, where the coefficients are determined using some variation of least-squares. However, such approaches lack any notion of uncertainty, which could be valuable in e.g. group analyses. In this work, we use a probabilistic interpretation of linear least-squares methods to recast popular dMRI models as Bayesian ones. This makes it possible to quantify the uncertainty of any derived quantity. In particular, for quantities that are affine functions of the coefficients, the posterior distribution can be expressed in closed-form. We simulated measurements from single- and double-tensor models where the correct values of several quantities are known, to validate …
-
Algorithms for magnetic resonance imaging in radiotherapyJens Sjölund2018
Radiotherapy plays an increasingly important role in cancer treatment, and medical imaging plays an increasingly important role in radiotherapy. Magnetic resonance imaging (MRI) is poised to be a major component in the development towards more effective radiotherapy treatments with fewer side effects. This thesis attempts to contribute in realizing this potential.
2017
-
Gaussian process regression can turn non-uniform and undersampled diffusion MRI data into diffusion spectrum imagingJens Sjölund, Anders Eklund, Evren Özarslan, and 1 more authorIn 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), 2017
We propose to use Gaussian process regression to accurately estimate the diffusion MRI signal at arbitrary locations in q-space. By estimating the signal on a grid, we can do synthetic diffusion spectrum imaging: reconstructing the ensemble averaged propagator (EAP) by an inverse Fourier transform. We also propose an alternative reconstruction method guaranteeing a nonnegative EAP that integrates to unity. The reconstruction is validated on data simulated from two Gaussians at various crossing angles. Moreover, we demonstrate on non-uniformly sampled in vivo data that the method is far superior to linear interpolation, and allows a drastic undersampling of the data with only a minor loss of accuracy. We envision the method as a potential replacement for standard diffusion spectrum imaging, in particular when acquisition time is limited.
2015
-
Constrained optimization of gradient waveforms for generalized diffusion encodingJens Sjölund, Filip Szczepankiewicz, Markus Nilsson, and 3 more authorsJournal of magnetic resonance, 2015
Diffusion MRI is a useful probe of tissue microstructure. The conventional diffusion encoding sequence, the single pulsed field gradient, has recently been challenged as more general gradient waveforms have been introduced. Out of these, we focus on q-space trajectory imaging, which generalizes the scalar b-value to a tensor valued entity. To take full advantage of its capabilities, it is imperative to respect the constraints imposed by the hardware, while at the same time maximizing the diffusion encoding strength. We provide a tool that achieves this by solving a constrained optimization problem that accommodates constraints on maximum gradient amplitude, slew rate, coil heating and positioning of radio frequency pulses. The method’s efficacy and flexibility is demonstrated both experimentally and by comparison with previous work on optimization of isotropic diffusion sequences.
-
Generating patient specific pseudo-CT of the head from MR using atlas-based regressionJens Sjölund, Daniel Forsberg, Mats Andersson, and 1 more authorPhysics in medicine and biology, 2015
Radiotherapy planning and attenuation correction of PET images require simulation of radiation transport. The necessary physical properties are typically derived from computed tomography (CT) images, but in some cases, including stereotactic neurosurgery and combined PET/MR imaging, only magnetic resonance (MR) images are available. With these applications in mind, we describe how a realistic, patient-specific, pseudo-CT of the head can be derived from anatomical MR images. We refer to the method as atlas-based regression, because of its similarity to atlas-based segmentation.Given a target MR and an atlas database comprising MR and CT pairs, atlas-based regression works by registering each atlas MR to the target MR, applying the resulting displacement fields to the corresponding atlas CTs and, finally, fusing the deformed atlas CTs into a single pseudo-CT.We use a deformable registration …
-
Dose-volume histogram prediction using density estimationJohanna Skarpman Munter and Jens SjölundPhysics in Medicine & Biology, 2015
Knowledge of what dose-volume histograms can be expected for a previously unseen patient could increase consistency and quality in radiotherapy treatment planning. We propose a machine learning method that uses previous treatment plans to predict such dose-volume histograms. The key to the approach is the framing of dose-volume histograms in a probabilistic setting.The training consists of estimating, from the patients in the training set, the joint probability distribution of some predictive features and the dose. The joint distribution immediately provides an estimate of the conditional probability of the dose given the values of the predictive features. The prediction consists of estimating, from the new patient, the distribution of the predictive features and marginalizing the conditional probability from the training over this. Integrating the resulting probability distribution for the dose yields an estimate of the dose …
-
MRI based radiotherapy planning and pulse sequence optimizationJens Sjölund2015
Radiotherapy plays an increasingly important role in cancer treatment, and medical imaging plays an increasingly important role in radiotherapy. Magnetic resonance imaging (MRI) is poised to be a major component in the development towards more effective radiotherapy treatments with fewer side effects. This thesis attempts to contribute in realizing this potential.
2014
-
Skull Segmentation in MRI by a Support Vector Machine Combining Local and Global FeaturesJens Sjölund, Andreas Eriksson Järliden, Mats Andersson, and 2 more authorsIn Pattern Recognition (ICPR), 2014 22nd International Conference on, 2014
Magnetic resonance (MR) images lack information about radiation transport-a fact which is problematic in applications such as radiotherapy planning and attenuation correction in combined PET/MR imaging. To remedy this, a crude but common approach is to approximate all tissue properties as equivalent to those of water. We improve upon this using an algorithm that automatically identifies bone tissue in MR. More specifically, we focus on segmenting the skull prior to stereotactic neurosurgery, where it is common that only MR images are available. In the proposed approach, a machine learning algorithm known as a support vector machine is trained on patients for which both a CT and an MR scan are available. As input, a combination of local and global information is used. The latter is needed to distinguish between bone and air as this is not possible based only on the local image intensity. A whole skull …
2012
-
Dose planning from MRI using machine learning for automatic segmentation of skull and airJens Sjölund2012
The superior soft tissue contrast and inherent patient safety of MRI makes it preferable to CT for many imaging tasks. However, the electron density information provided by CT makes it useful for dose calculations in radiotherapy. If these could instead be based solely on MRI it would spare the patient from additional ionizing radiation as well as saving the health provider the time and cost of an additional examination.In this thesis the possibility of achieving this using a machine learning algorithm called support vector machines to segment head MRI images into soft tissue, bone and air is investigated. To train the algorithm a large set of registered MRI and CT images corresponding to the same patients were used. The results were evaluated on five test patients using Monte Carlo simulations. An important finding was that the threshold value used to segment the bone in the CT images was important for the prediction performance. Moreover, the results indicate that there are significant variations in bone density among patients, an aspect with important implications for the accuracy of the dose calculations.
-
Gradient waveform design for cross-term-compensated diffusion MRI: Demonstration of tensor-valued encoding in phantom and simulationsFilip Szczepankiewicz and Jens Sjölund
Diffusion weighted imaging is perturbed by the presence of background gradients, or so-called’cross-terms,’causing bias in estimated parameters and fiber orientations. In this work, we present a novel gradient waveform design that removes the cross-term sensitivity entirely. This design is valuable for diffusion MRI methods that are otherwise corrupted by background gradients, and it also facilitates arbitrary sequence timing, b-tensor shapes and suppression of concomitant gradient effects.