Repository logo
 

SEQUENCE SLIDER: expanding polyalanine fragments for phasing with multiple side-chain hypotheses.

Published version
Peer-reviewed

Type

Article

Change log

Authors

Borges, Rafael Junqueira 
Meindl, Kathrin 
Triviño, Josep 
Sammito, Massimo 

Abstract

Fragment-based molecular-replacement methods can solve a macromolecular structure quasi-ab initio. ARCIMBOLDO, using a common secondary-structure or tertiary-structure template or a library of folds, locates these with Phaser and reveals the rest of the structure by density modification and autotracing in SHELXE. The latter stage is challenging when dealing with diffraction data at lower resolution, low solvent content, high β-sheet composition or situations in which the initial fragments represent a low fraction of the total scattering or where their accuracy is low. SEQUENCE SLIDER aims to overcome these complications by extending the initial polyalanine fragment with side chains in a multisolution framework. Its use is illustrated on test cases and previously unknown structures. The selection and order of fragments to be extended follows the decrease in log-likelihood gain (LLG) calculated with Phaser upon the omission of each single fragment. When the starting substructure is derived from a remote homolog, sequence assignment to fragments is restricted by the original alignment. Otherwise, the secondary-structure prediction is matched to that found in fragments and traces. Sequence hypotheses are trialled in a brute-force approach through side-chain building and refinement. Scoring the refined models through their LLG in Phaser may allow discrimination of the correct sequence or filter the best partial structures for further density modification and autotracing. The default limits for the number of models to pursue are hardware dependent. In its most economic implementation, suitable for a single laptop, the main-chain trace is extended as polyserine rather than trialling models with different sequence assignments, which requires a grid or multicore machine. SEQUENCE SLIDER has been instrumental in solving two novel structures: that of MltC from 2.7 Å resolution data and that of a pneumococcal lipoprotein with 638 residues and 35% solvent content.

Description

Keywords

Molecular replacement, Phasing, Arcimboldo, Phaser, Shelxe, Fragment-based Molecular Replacement, Sequence Slider, Side-chain Extension

Journal Title

Conference Name

Journal ISSN

Volume Title

Publisher

Sponsorship
Generalitat de Catalunya (2017SGR-1192)
H2020 Marie Skłodowska-Curie Actions (790122)
Ministerio de Ciencia e Innovación (MDM2014-0435-01, BIO2015-64216-P, 2017SGR-1192, BFU2017-90030-P, BIO2013-49604-EXP)
Fundação de Amparo à Pesquisa do Estado de São Paulo (2016/24191-8, 2017/13485-3)