HTTP/2 301
date: Mon, 19 Jan 2026 04:15:24 GMT
content-length: 0
location: https://doi.org/10.1101/433763
server: cloudflare
vary: Origin
expires: Tue, 20 Jan 2026 04:15:24 GMT
permissions-policy: interest-cohort=(),browsing-topics=()
cf-cache-status: DYNAMIC
nel: {"report_to":"cf-nel","success_fraction":0.0,"max_age":604800}
strict-transport-security: max-age=31536000; includeSubDomains; preload
report-to: {"group":"cf-nel","max_age":604800,"endpoints":[{"url":"https://a.nel.cloudflare.com/report/v4?s=gSsIwFSPiYRrEZI%2FhZjxLbMj%2BWgmmbWVHgXWn8rwMKCrsHwF4DkR6%2BxybRoHrX4TlLqwelwOZUO6yJSYusdUdFeBC8LRDw%3D%3D"}]}
cf-ray: 9c0382c06a3b75e9-BLR
alt-svc: h3=":443"; ma=86400
HTTP/2 302
date: Mon, 19 Jan 2026 04:15:24 GMT
content-type: text/html;charset=utf-8
location: https://biorxiv.org/lookup/doi/10.1101/433763
server: cloudflare
vary: Origin
vary: Accept
expires: Mon, 19 Jan 2026 05:11:01 GMT
permissions-policy: interest-cohort=(),browsing-topics=()
cf-cache-status: DYNAMIC
nel: {"report_to":"cf-nel","success_fraction":0.0,"max_age":604800}
strict-transport-security: max-age=31536000; includeSubDomains; preload
report-to: {"group":"cf-nel","max_age":604800,"endpoints":[{"url":"https://a.nel.cloudflare.com/report/v4?s=doys2BeB4eCeNAFl5PV2hyQ50Fdrxs0aZXRP7b7shtIXvxZP1wy71yDn1Ut1ptq0t9GBBMO%2B2e7kOnBGf3FPs9%2FVkafexg%3D%3D"}]}
cf-ray: 9c0382c1fc7075e9-BLR
alt-svc: h3=":443"; ma=86400
HTTP/1.1 302 Found
Date: Mon, 19 Jan 2026 04:15:25 GMT
Content-Type: text/html; charset=iso-8859-1
Transfer-Encoding: chunked
Connection: keep-alive
server: cloudflare
location: https://www.biorxiv.org/lookup/doi/10.1101/433763
cf-cache-status: DYNAMIC
Nel: {"report_to":"cf-nel","success_fraction":0.0,"max_age":604800}
Report-To: {"group":"cf-nel","max_age":604800,"endpoints":[{"url":"https://a.nel.cloudflare.com/report/v4?s=Lx7qpIDO3bTUKF0NfJCcc2C5KYVMuLSnlY7FnKnVYrsuRNx1k6BbdFKJ2U8%2BTA8IIIplOU3x6LDDgENJuibxEcT3S%2FWZ%2BLytiMeH"}]}
CF-RAY: 9c0382c299f723ff-BOM
alt-svc: h3=":443"; ma=86400
HTTP/2 301
date: Mon, 19 Jan 2026 04:15:25 GMT
content-type: text/html; charset=UTF-8
location: https://www.biorxiv.org/content/10.1101/433763v5
cf-ray: 9c0382c5cb19e9c3-BLR
x-content-type-options: nosniff
x-content-type-options: nosniff
x-drupal-cache: MISS
expires: Mon, 19 Jan 2026 04:45:25 GMT
cache-control: public, max-age=1800
x-varnish-ttl:
pragma: no-cache
vary: Accept-Encoding
x-highwire-sitecode: biorxiv
x-highwire-smart-code: biorxiv_production
x-varnish: 1897536862
x-varnish-cache:
via: 1.1 varnish
cf-cache-status: MISS
set-cookie: __cf_bm=sM6i6zkgRkQpYww0ZKw3Ht1U1a45ThcJ8IqxoTd5Tb8-1768796125-1.0.1.1-CCGtbS6UnnCfsZ8AD0yknfBsr4owlEBjtloOTMBOozrs8tzjG0PW2xOASQ1GkUyGOl9zoATDDh0CCU0VMWOELGj140dUS8_dxGRnTrctwPE; path=/; expires=Mon, 19-Jan-26 04:45:25 GMT; domain=.www.biorxiv.org; HttpOnly; Secure; SameSite=None
server: cloudflare
HTTP/2 200
date: Mon, 19 Jan 2026 04:15:28 GMT
content-type: text/html; charset=utf-8
content-encoding: gzip
x-content-type-options: nosniff
x-content-type-options: nosniff
x-drupal-cache: MISS
expires: Sun, 19 Nov 1978 05:00:00 GMT
cache-control: no-cache, must-revalidate
set-cookie: SSESS1dd6867f1a1b90340f573dcdef3076bc=TJ0vWGNJdCAP9ZcKdeNOheOLP_b5ugbY0_rssd2N0Eo; expires=Wed, 11-Feb-2026 07:48:46 GMT; path=/; domain=.biorxiv.org; secure; HttpOnly
content-language: en
x-frame-options: SAMEORIGIN
x-generator: Drupal 7 (https://drupal.org)
link:
; rel="canonical",; rel="shortlink"
vary: Accept-Encoding
x-highwire-sitecode: biorxiv
x-highwire-smart-code: biorxiv_production
x-varnish: 699661072
age: 0
via: 1.1 varnish
x-varnish-ttl:
x-varnish-cache:
cf-cache-status: DYNAMIC
server: cloudflare
cf-ray: 9c0382c91ff2e9c3-BLR
Unsupervised deep learning with variational autoencoders applied to breast tumor genome-wide DNA methylation data with biologic feature extraction | bioRxiv
New Results
Unsupervised deep learning with variational autoencoders applied to breast tumor genome-wide DNA methylation data with biologic feature extraction
View ORCID ProfileAlexander J. Titus, Owen M. Wilkins, Carly A. Bobak, Brock C. Christensen
doi: https://doi.org/10.1101/433763

Abstract
Recent advances in deep learning, particularly unsupervised approaches, have shown promise for furthering our biological knowledge through their application to gene expression datasets, though applications to epigenomic data are lacking. Here, we employ an unsupervised deep learning framework with variational autoencoders (VAEs) to learn latent representations of the DNA methylation landscape from three independent breast tumor datasets. Through interrogation of methylation-based learned latent dimension activation values, we demonstrate the feasibility of VAEs to track representative differential methylation patterns among clinical subtypes of tumors. CpGs whose methylation was most correlated VAE latent dimension activation values were significantly enriched for CpG sparse regulatory regions of the genome including enhancer regions. In addition, through comparison with LASSO, we show the utility of the VAE approach for revealing novel information about CpG DNA methylation patterns in breast cancer.
Copyright
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
It is made available under a CC-BY-NC-ND 4.0 International license.