#include <lm_state.h>

Public Member Functions
	LanguageModelNgramInfo (const char *c, int l, bool p, float nc, float ncc)

Public Attributes
STRING	context

int	context_unichar_step_len

bool	pruned

float	ngram_cost
	-ln(P_ngram_model(path)) More...

float	ngram_and_classifier_cost
	-[ ln(P_classifier(path)) + scale_factor * ln(P_ngram_model(path)) ] More...

Detailed Description

Struct for storing additional information used by Ngram language model component.

Definition at line 72 of file lm_state.h.

Constructor & Destructor Documentation

◆ LanguageModelNgramInfo()

tesseract::LanguageModelNgramInfo::LanguageModelNgramInfo	(	const char *	c,
		int	l,
		bool	p,
		float	nc,
		float	ncc
	)

inline

Definition at line 73 of file lm_state.h.

74 : context(c), context_unichar_step_len(l), pruned(p), ngram_cost(nc),

75 ngram_and_classifier_cost(ncc) {}

tesseract::LanguageModelNgramInfo::ngram_and_classifier_cost

float ngram_and_classifier_cost

-[ ln(P_classifier(path)) + scale_factor * ln(P_ngram_model(path)) ]

Definition: lm_state.h:88

tesseract::LanguageModelNgramInfo::context_unichar_step_len

int context_unichar_step_len

Definition: lm_state.h:79

tesseract::LanguageModelNgramInfo::ngram_cost

float ngram_cost

-ln(P_ngram_model(path))

Definition: lm_state.h:86

tesseract::LanguageModelNgramInfo::pruned

bool pruned

Definition: lm_state.h:84

tesseract::LanguageModelNgramInfo::context

STRING context

Definition: lm_state.h:76

Member Data Documentation

◆ context

STRING tesseract::LanguageModelNgramInfo::context

Definition at line 76 of file lm_state.h.

◆ context_unichar_step_len

int tesseract::LanguageModelNgramInfo::context_unichar_step_len

Length of the context measured by advancing using UNICHAR::utf8_step() (should be at most the order of the character ngram model used).

Definition at line 79 of file lm_state.h.

◆ ngram_and_classifier_cost

float tesseract::LanguageModelNgramInfo::ngram_and_classifier_cost

-[ ln(P_classifier(path)) + scale_factor * ln(P_ngram_model(path)) ]

Definition at line 88 of file lm_state.h.

◆ ngram_cost

float tesseract::LanguageModelNgramInfo::ngram_cost

-ln(P_ngram_model(path))

Definition at line 86 of file lm_state.h.

◆ pruned

bool tesseract::LanguageModelNgramInfo::pruned

The paths with pruned set are pruned out from the perspective of the character ngram model. They are explored further because they represent a dictionary match or a top choice. Thus ngram_info is still computed for them in order to calculate the combined cost.

Definition at line 84 of file lm_state.h.

The documentation for this struct was generated from the following file:

/usr/src/tesseract-ocr.master/src/wordrec/lm_state.h

Public Member Functions

Public Attributes