Scene text detection and recognition has attracted increasing research attention recently, especially for text of arbitrary shapes. In most of text spotting methods, text feature alignment is a key component to connect the detector and recognizer for end-to-end training. Existing alignment methods can be roughly categorized into those based on global consistent transformations and based on character-level classification. However, these methods either are unreliable for heavily deformed text or ignore contextual information in recognition. In this paper, we propose a novel textspotter named TextTriangle, which detects and recognizes text of arbitrary shapes in an end-to-end manner without character-level annotations. In TextTriangle, a text instance is described as a sequence of ordered triangles attached to each other. Based on this representation, a new PiecewiseAlign layer is designed to accurately extract features of the text instance with arbitrary shapes, which is the key to make the framework end-to-end trainable. Compared with the methods based on global consistent transformations, PiecewiseAlign adopts piecewise linear transformation for feature calculation. Experiments show that PiecewiseAlign is superior to TPS-based method in text alignment, and TextTriangle achieves competitive performance on standard scene text benchmarks.