Draft:Financial Labelling

Submission declined on 14 December 2025 by Timtrent (talk).

This submission is not adequately supported by reliable sources. Reliable sources are required so that information can be verified. If you need help with referencing, please see Referencing for beginners and Citing sources.

If you would like to continue working on the submission, click on the "Edit" tab at the top of the window.
If you have not resolved the issues listed above, your draft will be declined again and potentially deleted.
If you need extra help, please ask us a question at the AfC Help Desk or get live help from experienced editors.
Please do not remove reviewer comments or this notice until the submission is accepted.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Declined by Timtrent 41 hours ago. Last edited by Timtrent 41 hours ago. Reviewer: Inform author.

Resubmit

Please note that if the issues are not fixed, the draft will be declined again.

Submission declined on 14 November 2025 by Aesurias (talk).

Your draft shows signs of having been generated by a large language model, such as ChatGPT. Wikipedia guidelines prohibit the use of LLMs to write articles from scratch. In addition, LLM-generated articles usually have multiple quality issues, to include:

Promotional tone, editorializing and other words to watch
Vague, generic, and speculative statements extrapolated from similar subjects
Essay-like writing
Hallucinations (plausible-sounding, but false information) and non-existent references
Close paraphrasing

Please address these issues. The best way is usually to read reliable sources and summarize them, instead of using a large language model. See our help page on large language models.

This draft's references do not show that the subject qualifies for a Wikipedia article. In summary, the draft needs multiple published sources that are:

in-depth (not just passing mentions about the subject)
reliable
secondary
independent of the subject

Make sure you add references that meet these criteria before resubmitting. Learn about mistakes to avoid when addressing this issue. If no additional references exist, the subject is not suitable for Wikipedia.

Declined by Aesurias 32 days ago.

Comment: Great swathes of text have no citations
We require references from significant coverage about the topic of the article, and independent of it, in multiple secondary sources which are WP:RS please. See WP:42. Please also see WP:PRIMARY which details the limited permitted usage of primary sources and WP:SELFPUB which has clear limitations on self published sources. Providing sufficient references, ideally one per fact referred to, that meet these tough criteria is likely to allow this article to remain. Lack of them or an inability to find them is likely to mean that the topic is not suitable for inclusion, certainly today. 🇵🇸‍🇺🇦 Fiddle^Timtrent Faddle^Talk to me 🇺🇦‍🇵🇸 23:22, 14 December 2025 (UTC)

Financial labelling is the process of assigning outcome variables to financial time series for use in machine learning, quantitative finance and backtesting. It converts raw market data such as prices, returns or volumes into labelled observations that can be used for supervised learning, risk analysis or model evaluation. Labelling methods aim to characterize the direction, magnitude and timing of market movements while limiting sources of methodological bias such as look-ahead bias.^[1]^[2]

In addition to bias control, effective labelling must address several statistical challenges inherent in financial data. One major issue is the non-stationarity of financial time series: distributions of returns, volatilities and correlations often shift due to evolving market regimes, changes in liquidity and structural breaks in economic conditions. Because the underlying data-generating process is not stable over time, labels derived from past data may not generalize out-of-sample unless the labelling method adapts to changing volatility or event frequency.^[3]

Another central challenge is the extremely low signal-to-noise ratio characteristic of most financial prediction tasks. Price dynamics are dominated by noise, and economically meaningful movements tend to be weak and infrequent. In such an environment, even small imperfections in the input data or feature engineering can have outsized effects on model performance. This reflects the classic garbage in, garbage out problem and makes robust data preprocessing and validation essential.^[4] Labels that rely on fixed horizons or fixed thresholds can inadvertently encode noise rather than signal, leading to model overfitting and poor robustness. Event-based labelling methods, volatility-scaled barriers and trend-scanning approaches attempt to reduce this problem by defining outcomes relative to market conditions instead of arbitrary time intervals.^[5]

Financial time series also exhibit heteroskedasticity, autocorrelation and regime shifts, all of which influence the statistical properties of the generated labels. Techniques such as structural break detection, volatility modelling and event-based sampling help maintain the integrity of the labelling process by better aligning labels with the true economic events occurring in the market. Ensuring that the labelled dataset reflects tradable opportunities rather than artifacts of sampling or microstructure noise is essential for building reliable machine-learning models.

Overview

Financial labelling is a central step in the construction of machine learning systems for forecasting or classification in finance. Because financial data are sequential and exhibit temporal dependence, labels must reflect not only the sign of outcomes but also the time over which those outcomes occur. A label typically represents whether a price change or trading rule would have produced a gain, a loss or no meaningful movement over a defined period or under a specific rule.

Most labelling frameworks involve three design elements:

Selecting the events or observations to be labelled.
Defining labelling methods for how the outcome of each event is measured.
Choosing validation procedures that avoid information leakage from the future.

Recent papers highlight the importance of robust labelling due to the instability and noise present in financial time series, and the tendency of many models to overfit when labels are poorly constructed.^[6]^[7]

Event generation

Before outcomes can be assigned, researchers determine which points in time are to be labelled. Common approaches include:

Time bars: Observations sampled at fixed time intervals.
Tick bars: Bars formed after a fixed number of transactions.
Volume bars: Observations created once cumulative traded volume reaches a threshold.
Dollar bars: Bars formed when cumulative traded dollar value exceeds a specified amount.
Information bars: The purpose of information-driven bars is to sample more frequently when new information arrives to the market. In this context, the word “information” is used in a market microstructural sense. Market microstructure theories confer special importance to the persistence of imbalanced signed volumes, as that phenomenon is associated with the presence of informed traders. By synchronizing sampling with the arrival of informed traders, we may be able to make decisions before prices reach a new equilibrium level.

For sampling, event-based methods such as CUSUM filters that detect statistically significant price changes may be used. Event-based bars are increasingly studied due to their improved statistical properties compared with time bars, while dollar bars are seen to have more stable characteristics over longer timeframes as market volumes and trade frequency evolve with more participants and efficient market-makers, but dollar turnover remains relatively stable. ^[1]

Labelling methods

A variety of labelling approaches exist depending on the research goal:

Fixed time horizon labelling

As it relates to finance, virtually all ML papers label observations using the fixed-time-horizon method. This method assigns each event a label based on the return over the next h bars. Using a forward return r = (pti,0+h / pti,0) − 1, a threshold τ partitions outcomes into positive, negative or neutral classes.

This method is simple but has notable limitations.^[1] Volatility varies across time bars, making a constant threshold τ behave inconsistently. Labels also ignore the price path: a position may hit a stop-loss or margin limit long before the horizon ends, causing unrealistic labels.

These issues motivate the use of path-dependent alternatives such as the triple-barrier method and trend-scanning labels.

Triple-barrier method

The triple-barrier method was introduced by Marcos Lopez De Prado as an alternative, path-dependent labelling framework intended to better reflect how trades evolve in practice. For each event, three barriers are defined: two price-based (upper and lower) and one time-based.

The upper and lower barriers act as profit-taking and stop-loss limits. Their width is determined dynamically using an estimate of volatility: either realized or implied, scaled as desired. These barriers do not need to be symmetric.
The vertical barrier is a maximum holding-period constraint measured as a fixed number of bars from the event’s start time. If this time limit is reached before either price barrier, the label is determined either by the sign of the return at expiration or set to 0, depending on the practitioner’s preference.

The assigned label reflects the first barrier touched:

Upper barrier touched → label = 1
Lower barrier touched → label = −1
Vertical barrier touched → label = sign of return (or 0 if using the neutral convention)

Because the label depends on the path of prices, the method requires evaluating movements over the interval , where h is the vertical barrier.

The triple-barrier method improves upon fixed-horizon labeling by incorporating information about the full price path rather than relying solely on end-of-period returns. Because the label is determined by the first barrier breached, the framework closely approximates practical trade management rules such as stop-loss, take-profit, and maximum holding-time constraints. Barrier levels are typically scaled by contemporaneous volatility, which helps reduce look-ahead bias and ensures that thresholds reflect market conditions observable at the event’s initiation.

The method also mitigates class imbalance common in financial classification tasks. Dynamic upper and lower price barriers, combined with a time-based expiration, tend to generate a more even distribution of positive, negative, and neutral outcomes compared with single-horizon return labels. Because barrier widths expand and contract with volatility, the resulting labels naturally adapt across different market regimes.

In machine-learning applications, the triple-barrier framework is widely used for constructing targets in meta-labeling, signal validation, and risk-aware forecasting. Its path-dependent labels capture features of real trade execution, enabling models trained on them to align more closely with practical trading objectives and systematic strategy design.

Trend-scanning labelling

Trend-scanning labels identify the forward window that maximizes a statistical measure of trend strength. Because the optimal horizon changes with market conditions, labels adapt to local directional structure rather than imposing a fixed horizon.

Meta-labelling

Meta-labelling in financial time-series is used when a primary model determines trade direction. A secondary model is trained on labels indicating whether the trade would have been profitable. The secondary model filters signals, aiming to distinguish true positives from false positives and improve precision and recall.

It reduces overfitting, allows the integration of domain-specific primary models, and enables asymmetric long/short logic. It separates signal generation from position sizing, improving risk management.^[6]^[8]

Validation considerations

Because labels depend on future price behavior, improper validation can lead to information leakage. In financial datasets, observations often overlap in time, and label horizons extend forward, creating dependencies that standard cross-validation cannot handle. Several techniques are designed specifically to address serial dependence, overlapping events, and temporal structure:

Purged k-fold cross-validation: This method removes (“purges”) from the training set any samples whose label-generation window overlaps with the test fold. Since financial labels often use forward-looking horizons (e.g., barrier touches, event windows), training on points that occur before the test fold but whose labeling window extends into the test period would leak future information. Purged k-fold ensures that only truly non-overlapping training samples are used, preserving the temporal integrity of the validation procedure.

Embargo periods: Even after purging overlaps, samples that occur immediately after a test fold may still indirectly contain information about the test window due to market reactions or structural continuity in the data. An embargo period enforces a buffer of time after each test fold during which samples are excluded from training. The embargo is typically set to the maximum look-ahead horizon used in the labeling process, preventing subtle short-range leakage.

Combinatorial cross-validation: Financial time series often exhibit regime shifts and non-stationarity, meaning that model performance can vary significantly depending on which periods are used for training and testing. Combinatorial cross-validation constructs multiple non-overlapping combinations of folds, evaluating the model across many temporal arrangements rather than relying on a single split pattern. This helps reduce variance in performance estimates, improves robustness, and reveals how sensitive a model is to specific market regimes.

These approaches address the serial dependence, overlapping observations, and temporal dynamics common in financial datasets, providing more reliable and unbiased model validation. ^[1]

Challenges

Financial labelling presents several difficulties:

Class imbalance: Neutral or negative labels may dominate.
Overlapping events: Creates highly dependent training samples.
Barrier calibration: Volatility estimates vary with market regime.
Sampling bias: Time bars may underrepresent high-information periods.
Path dependence: Fixed-horizon labels ignore early stop-outs.

Recent work emphasizes careful feature engineering and thorough label construction to avoid structural modelling errors.^[8]

Applications

Financial labelling is used in:

Supervised learning models for price direction or profitability
Meta-models that filter directional forecasts
Backtesting strategies with explicit entry and exit rules
Risk modelling frameworks with dynamic thresholds

References

^ ^a ^b ^c ^d López de Prado, Marcos (2018). Advances in Financial Machine Learning. Wiley.
^ "What is Data Labeling? - Data Labeling Explained - AWS". Amazon Web Services, Inc. Retrieved 16 July 2024.
^ Hamilton, James D. (1994). Time Series Analysis. Princeton University Press.
^ Geiger, R. Stuart; Cope, Dominique; Ip, Jamie; Lotosh, Marsha; Shah, Aayush; Weng, Jenny; Tang, Rebekah (5 November 2021). ""Garbage in, garbage out" revisited: What do machine learning application papers report about human-labeled training data?". Quantitative Science Studies. 2 (3): 795–827. arXiv:2107.02278. doi:10.1162/qss_a_00144. ISSN 2641-3337.
^ Bouchaud, Jean-Philippe; Potters, Marc (2003). Theory of Financial Risk and Derivative Pricing. Cambridge University Press.
^ ^a ^b López de Prado, Marcos (2019). "Ten Applications of Financial Machine Learning". SSRN. SSRN 3365271.
^ Dixon, Matthew; Halperin, Igor (2019). "The Four Horsemen of Machine Learning in Finance". SSRN. doi:10.2139/ssrn.3453564. SSRN 3453564.
^ ^a ^b "Machine Learning from a "Universe" of Signals: The Role of Feature Engineering". SSRN. 2025. SSRN 5248179.

[AFML-1] López de Prado, Marcos (2018). Advances in Financial Machine Learning. Wiley.

[2] "What is Data Labeling? - Data Labeling Explained - AWS". Amazon Web Services, Inc. Retrieved 16 July 2024.

[Hamilton1994-3] Hamilton, James D. (1994). Time Series Analysis. Princeton University Press.

[4] Geiger, R. Stuart; Cope, Dominique; Ip, Jamie; Lotosh, Marsha; Shah, Aayush; Weng, Jenny; Tang, Rebekah (5 November 2021). ""Garbage in, garbage out" revisited: What do machine learning application papers report about human-labeled training data?". Quantitative Science Studies. 2 (3): 795–827. arXiv:2107.02278. doi:10.1162/qss_a_00144. ISSN 2641-3337.

[BouchaudPotters-5] Bouchaud, Jean-Philippe; Potters, Marc (2003). Theory of Financial Risk and Derivative Pricing. Cambridge University Press.

[TenApps-6] López de Prado, Marcos (2019). "Ten Applications of Financial Machine Learning". SSRN. SSRN 3365271.

[4Horsemen-7] Dixon, Matthew; Halperin, Igor (2019). "The Four Horsemen of Machine Learning in Finance". SSRN. doi:10.2139/ssrn.3453564. SSRN 3453564.

[Signals2025-8] "Machine Learning from a "Universe" of Signals: The Role of Feature Engineering". SSRN. 2025. SSRN 5248179.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]