SDTR: Soft Decision Tree Regressor for Tabular Data

Luo, Haoran; Cheng, Fan; Yu, Heng; Yi, Yuqi

doi:10.1109/access.2021.3070575

Cited by 36 publications

(17 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since the input of the auxiliary models in the second and third stages of the TMSCNet was the numerical data rather than the image, the classical machine learning regression algorithm and fully connected network (FCN) were considered. There were many classical regression models in machine learning, such as Linear Regression ( Austin and Steyerberg, 2015 ), Support Vector Regressor ( Cortes and Vapnik, 1995 ), K-Nearest Neighbors Regressor ( Song et al, 2017 ), Decision Tree Regressor ( Luo et al, 2021 ), Random Forest Regressor ( Ding and Bar-Joseph, 2017 ), AdaBoost Regressor ( Chen et al, 2019 ), and Bagging Regressor ( Dal Molin Ribeiro and Coelho, 2020 ). For selecting more suitable self-correcting models, we first conducted comparative experiments on the classical machine learning regression algorithm to select the best performing method, and then implement comparative experiments on the optimized classical machine learning algorithm and fully connected network to determine the method of the auxiliary model in the second and third stages of the TMSCNet.…”

Section: Discussionmentioning

confidence: 99%

TMSCNet: A three-stage multi-branch self-correcting trait estimation network for RGB and depth images of lettuce

Zhang

et al. 2022

Front. Plant Sci.

View full text Add to dashboard Cite

Growth traits, such as fresh weight, diameter, and leaf area, are pivotal indicators of growth status and the basis for the quality evaluation of lettuce. The time-consuming, laborious and inefficient method of manually measuring the traits of lettuce is still the mainstream. In this study, a three-stage multi-branch self-correcting trait estimation network (TMSCNet) for RGB and depth images of lettuce was proposed. The TMSCNet consisted of five models, of which two master models were used to preliminarily estimate the fresh weight (FW), dry weight (DW), height (H), diameter (D), and leaf area (LA) of lettuce, and three auxiliary models realized the automatic correction of the preliminary estimation results. To compare the performance, typical convolutional neural networks (CNNs) widely adopted in botany research were used. The results showed that the estimated values of the TMSCNet fitted the measurements well, with coefficient of determination (R2) values of 0.9514, 0.9696, 0.9129, 0.8481, and 0.9495, normalized root mean square error (NRMSE) values of 15.63, 11.80, 11.40, 10.18, and 14.65% and normalized mean squared error (NMSE) value of 0.0826, which was superior to compared methods. Compared with previous studies on the estimation of lettuce traits, the performance of the TMSCNet was still better. The proposed method not only fully considered the correlation between different traits and designed a novel self-correcting structure based on this but also studied more lettuce traits than previous studies. The results indicated that the TMSCNet is an effective method to estimate the lettuce traits and will be extended to the high-throughput situation. Code is available at https://github.com/lxsfight/TMSCNet.git.

show abstract

Section: Discussionmentioning

confidence: 99%

TMSCNet: A three-stage multi-branch self-correcting trait estimation network for RGB and depth images of lettuce

Zhang

et al. 2022

Front. Plant Sci.

View full text Add to dashboard Cite

show abstract

“…This paper by [21] (Ali et al 2021) uses Decision tree regressor with neural networks for performance optimization and has an accuracy of 90%. (Luo et al 2021)) [22] proposed a Soft Decision Tree Regressor (SDTR),a differentiable hierarchical neural regression model SDTR is a differentiable neural network that mimics a binary decision tree and is suitable for ensemble techniques such as bagging and boosting, as well as archiving the results of 95.34%.…”

Section: Discussionmentioning

confidence: 99%

Design and Implementation of Sales Prediction Model Using Decision Tree Regressor over Linear Regression Towards Increase in Accuracy of Prediction

Reddy

Malathi

2022

Advances in Parallel Computing Algorithms, Tools and Paradigms

View full text Add to dashboard Cite

The purpose is to predict future price of product and assists companies to make business strategic plans to increase overall sales and also an experiment is performed to find the best suitable algorithm among Linear Regression and Novel Decision tree regressor. Predicting future price of a product using linear regression algorithm (N=10) and Novel Decision tree regressor (N=10). Dataset used is Bigmart Sales data from kaggle. The sample size is 542 for each group. Novel Decision tree regressor produces a better accuracy of 97.5% and for Linear regression classifier is 87.6%with a statistical significance value of p is 0.03 (p<0.05). The results proved that the Novel Decision tree regressor is significantly better for sales forecasting than linear regression algorithm within the study’s limits.

show abstract

“…CatBoost's implementation of gradient boosted decision trees has a proven track record in the context of several different and similarly sized problems (Bentéjac et al, 2021; Luo et al, 2021). Since the proposed solution is novel the explainability of the underlying decision trees can help in investigating performance and identifying what variables play the most significant role in location predictions.…”

Section: Methodsmentioning

confidence: 99%

“…This allows for more complex models to potentially learn policy and construction year patterns. Recent work by Luo et al (2021) addresses the favorability of traditional machine learning approaches over complex deep learning methods in the context of supervised regression with heterogeneous tabular input data. Furthermore, recent work conducting comprehensive comparisons of state‐of‐the‐art interpretable and uninterpretable models on several datasets reveals that CatBoost's implementation of an ensemble algorithm for gradient boosting on decision trees outperforms all other models on average (Bentéjac et al, 2021).…”

Section: Related Workmentioning

confidence: 99%

Multi‐unit building address geocoding: An approach without indoor location reference data

Nottrot

Folmer

Roy

et al. 2023

Transactions in GIS

View full text Add to dashboard Cite

Accurately mapped locations within multi‐unit properties are useful for several organizations in today's society. Published work on geocoding methods either require detailed location reference data or does not apply to multi‐unit buildings. In this research, a generalizable method is realized to map apartment addresses to their explicit locations without access to indoor location reference data based on publicly available address‐ and geospatial‐building information. The performance of this approach is measured by conducting a comparative study between a linear interpolation baseline and gradient‐boosted decision trees model. The proposed method can successfully geocode addresses across different building shapes and sizes. Furthermore, the model significantly outperforms the baseline in terms of positional accuracy proving the feasibility of approximating apartment locations by their address‐ and geospatial‐building information.

show abstract

SDTR: Soft Decision Tree Regressor for Tabular Data

Cited by 36 publications

References 27 publications

TMSCNet: A three-stage multi-branch self-correcting trait estimation network for RGB and depth images of lettuce

TMSCNet: A three-stage multi-branch self-correcting trait estimation network for RGB and depth images of lettuce

Design and Implementation of Sales Prediction Model Using Decision Tree Regressor over Linear Regression Towards Increase in Accuracy of Prediction

Multi‐unit building address geocoding: An approach without indoor location reference data

Contact Info

Product

Resources

About