This article evaluates RECKONING's generalizability on the real-world multi-hop logical reasoning task, FOLIO.This article evaluates RECKONING's generalizability on the real-world multi-hop logical reasoning task, FOLIO.

RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge

Abstract and 1. Introduction

  1. Background

  2. Method

  3. Experiments

    4.1 Multi-hop Reasoning Performance

    4.2 Reasoning with Distractors

    4.3 Generalization to Real-World knowledge

    4.4 Run-time Analysis

    4.5 Memorizing Knowledge

  4. Related Work

  5. Conclusion, Acknowledgements, and References

\ A. Dataset

B. In-context Reasoning with Distractors

C. Implementation Details

D. Adaptive Learning Rate

E. Experiments with Large Language Models

4.3 Generalization to Real-World knowledge

To investigate how generalizable our method is to real-world knowledge beyond the synthetic setting, we evaluate RECKONING on a more real-world multi-hop logical reasoning task, FOLIO [29], and report the result in Table 2. The dataset has a rich vocabulary, diverse logic patterns, and abundant language variations. It has been shown to challenge LLMs in both supervised fine-tuning and in-context learning settings. We fine-tune the GPT-2 model following the in-context reasoning setting as the baseline. As before, we train the GPT-2 model and RECKONING using the multi-task objective. We also compare to more advanced baselines, including GPT-3.5 (text-davinci-003 [55]) and ChatGPT(gpt-3.5-turbo[2]), two popular large language models with around 175B parameters. For these two large models, we evaluate both in the zero-shot and few-shot settings. In the few-shot setting, we prompt the model with 8 single-task examples randomly sampled from the training set to perform in-context learning. We find that RECKONING’s performance (which is initiated here from GPT-2) is better than the GPT-2 in-context reasoning baseline. Compared to the two advanced large language models, RECKONING outperforms them by a significant margin (12% 0-shot and 7% 8-shot). We conclude that RECKONING is effective and significantly benefits reasoning tasks using real-world knowledge.

\ Table 2: Evaluation results on FOLIO. We compare RECKONING against the FT-ICR baseline with GPT-2 and two popular large language models.

\

:::info Authors:

(1) Zeming Chen, EPFL ([email protected]);

(2) Gail Weiss, EPFL ([email protected]);

(3) Eric Mitchell, Stanford University ([email protected])';

(4) Asli Celikyilmaz, Meta AI Research ([email protected]);

(5) Antoine Bosselut, EPFL ([email protected]).

:::


:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

2 https://openai.com/blog/chatgpt

Market Opportunity
RealLink Logo
RealLink Price(REAL)
$0.07302
$0.07302$0.07302
-0.54%
USD
RealLink (REAL) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Viewbots.com Redefines the “Viewbot” with the Launch of the Industry’s First AI-Powered Growth Engine

Viewbots.com Redefines the “Viewbot” with the Launch of the Industry’s First AI-Powered Growth Engine

Viewbots.com Redefines the “Viewbot” with the Launch of the Industry’s First AI-Powered Growth Engine Moving beyond simple metric inflation, the new platform utilizes
Share
Techbullion2026/01/25 20:49
Five Market Events Next Week Could Decide Bitcoin’s Next Big Move

Five Market Events Next Week Could Decide Bitcoin’s Next Big Move

Five US events next week GDP, $8.3B liquidity ops, Fed rate decision, balance sheet update and FOMC speech may steer Bitcoin soon. Financial markets are preparing
Share
LiveBitcoinNews2026/01/25 21:00
HOT MOMENTS: FOMC Statement Released Following the Fed Interest Rate Decision – Here Are All the Details of the Full Text

HOT MOMENTS: FOMC Statement Released Following the Fed Interest Rate Decision – Here Are All the Details of the Full Text

The post HOT MOMENTS: FOMC Statement Released Following the Fed Interest Rate Decision – Here Are All the Details of the Full Text appeared on BitcoinEthereumNews.com. The Fed has resumed interest rate cuts after a nine-month hiatus, lowering the federal funds rate by 25 basis points to a range of 4% to 4.25%. According to the “dot plot” projection reflected in the decision text, two additional interest rate cuts are envisaged in 2025. While 9 out of 19 officials expected two more interest rate cuts this year, 2 predicted a single cut, and 6 predicted no additional cuts. Newly appointed Fed Board member Stephen I. Miran dissented from the decision, voting for a stronger 50 basis point cut. The decision noted that economic growth slowed in the first half of the year, employment growth slowed, and the unemployment rate rose slightly. It also noted that inflation had begun to rise but remained high. While reiterating that it maintains its long-term targets of maximum employment and 2% inflation, the Fed noted that uncertainties regarding the economic outlook remain high. The statement read, “The Committee assesses that downside risks to employment have increased, in line with the balance of risks.” The statement stated that interest rate policy will be reshaped in the coming period, taking into account future data, the economic outlook, and the balance of risks. It also noted that the reduction in holdings of Treasury bonds, corporate debt instruments, and mortgage-backed securities will continue. The resolution was supported by Fed Chair Jerome Powell, Vice Chair John C. Williams, and board members Michael S. Barr, Michelle W. Bowman, Susan M. Collins, Lisa D. Cook, Austan D. Goolsbee, Philip N. Jefferson, Alberto G. Musalem, Jeffrey R. Schmid, and Christopher J. Waller. *This is not investment advice. Follow our Telegram and Twitter account now for exclusive news, analytics and on-chain data! Source: https://en.bitcoinsistemi.com/hot-moments-fomc-statement-released-following-the-fed-interest-rate-decision-here-are-all-the-details-of-the-full-text/
Share
BitcoinEthereumNews2025/09/18 14:18