A B C D E F G H I K L M N O P R S T U V W

A

a - Variable in class morfologik.fsa.FSAUtils.IntIntHolder
 
accept(int) - Method in interface morfologik.fsa.StateVisitor
 
add(byte[], int, int) - Method in class morfologik.fsa.FSABuilder
Add a single sequence of bytes to the FSA.
ADDRESS_OFFSET - Static variable in class morfologik.fsa.ConstantArcSizeFSA
Offset of the address field inside an arc.
ADDRESS_OFFSET - Static variable in class morfologik.fsa.FSA5
An offset in the arc structure, where the address and flags field begins.
annotation - Variable in class morfologik.fsa.FSA5
Annotation character.
annotationByte - Variable in class morfologik.fsa.FSA5Serializer
 
ARC_SIZE - Static variable in class morfologik.fsa.ConstantArcSizeFSA
Size of a single arc structure.
arcs - Variable in class morfologik.fsa.CFSA
An array of bytes with the internal representation of the automaton.
arcs - Variable in class morfologik.fsa.CFSA2
An array of bytes with the internal representation of the automaton.
arcs - Variable in class morfologik.fsa.FSA5
An array of bytes with the internal representation of the automaton.
arcsCount - Variable in class morfologik.fsa.FSAInfo
Number of arcs in the automaton, excluding an arcs from the zero node (initial) and an arc from the start node to the root node.
arcsCountTotal - Variable in class morfologik.fsa.FSAInfo
Total number of arcs, counting arcs that physically overlap due to merging.
Arrays - Class in morfologik.util
Compatibility layer for JVM 1.5.
assertExists(File, boolean, boolean) - Static method in class morfologik.util.FileUtils
Checks if the given file exists.
asShort(Set<FSAFlags>) - Static method in enum morfologik.fsa.FSAFlags
Returns the set of flags encoded in a single short.
asString(byte[], String) - Static method in class morfologik.tools.MorphEncoder
Converts a byte array to a given encoding.
ATTR_NAME_ENCODING - Static variable in class morfologik.stemming.DictionaryMetadata
Attribute name for DictionaryMetadata.encoding.
ATTR_NAME_SEPARATOR - Static variable in class morfologik.stemming.DictionaryMetadata
Attribute name for DictionaryMetadata.separator.
ATTR_NAME_USES_INFIXES - Static variable in class morfologik.stemming.DictionaryMetadata
Attribute name for DictionaryMetadata.usesInfixes.
ATTR_NAME_USES_PREFIXES - Static variable in class morfologik.stemming.DictionaryMetadata
Attribute name for DictionaryMetadata.usesPrefixes.
AUTOMATON_HAS_PREFIX - Static variable in class morfologik.fsa.MatchResult
The automaton contains a prefix of the input sequence.

B

b - Variable in class morfologik.fsa.FSAUtils.IntIntHolder
 
BIT_ARC_FINAL - Static variable in class morfologik.fsa.ConstantArcSizeFSA
An arc flag indicating the target node of an arc corresponds to a final state.
BIT_ARC_LAST - Static variable in class morfologik.fsa.ConstantArcSizeFSA
An arc flag indicating the arc is last within its state.
BIT_FINAL_ARC - Static variable in class morfologik.fsa.CFSA
Bitmask indicating that an arc corresponds to the last character of a sequence available when building the automaton.
BIT_FINAL_ARC - Static variable in class morfologik.fsa.CFSA2
The arc corresponds to the last character of a sequence available when building the automaton (acceptor transition).
BIT_FINAL_ARC - Static variable in class morfologik.fsa.FSA5
Bit indicating that an arc corresponds to the last character of a sequence available when building the automaton.
BIT_LAST_ARC - Static variable in class morfologik.fsa.CFSA
Bitmask indicating that an arc is the last one of the node's list and the following one belongs to another node.
BIT_LAST_ARC - Static variable in class morfologik.fsa.CFSA2
The arc is the last one from the current node's arcs list.
BIT_LAST_ARC - Static variable in class morfologik.fsa.FSA5
Bit indicating that an arc is the last one of the node's list and the following one belongs to another node.
BIT_TARGET_NEXT - Static variable in class morfologik.fsa.CFSA
Bitmask indicating that the target node of this arc follows it in the compressed automaton structure (no goto field).
BIT_TARGET_NEXT - Static variable in class morfologik.fsa.CFSA2
The target node of this arc follows the last arc of the current state (no goto field).
BIT_TARGET_NEXT - Static variable in class morfologik.fsa.FSA5
Bit indicating that the target node of this arc follows it in the compressed automaton structure (no goto field).
bits - Variable in enum morfologik.fsa.FSAFlags
Bit mask for the corresponding flag.
BufferUtils - Class in morfologik.util
Utility functions for buffers.
build(byte[][]) - Static method in class morfologik.fsa.FSABuilder
Build a minimal, deterministic automaton from a sorted list of byte sequences.
build(Iterable<byte[]>) - Static method in class morfologik.fsa.FSABuilder
Build a minimal, deterministic automaton from an iterable list of byte sequences.

C

calculateFanOuts(FSA, int) - Static method in class morfologik.fsa.FSAUtils
Calculate fan-out ratio.
CFSA - Class in morfologik.fsa
CFSA (Compact Finite State Automaton) binary format implementation.
CFSA(InputStream) - Constructor for class morfologik.fsa.CFSA
Creates a new automaton, reading it from a file in FSA format, version 5.
CFSA2 - Class in morfologik.fsa
CFSA (Compact Finite State Automaton) binary format implementation, version 2: CFSA2.BIT_TARGET_NEXT applicable on all arcs, not necessarily the last one.
CFSA2(InputStream) - Constructor for class morfologik.fsa.CFSA2
Reads an automaton from a byte stream.
CFSA2Serializer - Class in morfologik.fsa
Serializes in-memory FSA graphs to CFSA2.
CFSA2Serializer() - Constructor for class morfologik.fsa.CFSA2Serializer
 
checkSingleByte(String) - Static method in class morfologik.tools.FSABuildTool
Check if the argument is a single byte after conversion using platform-default encoding.
clone() - Method in class morfologik.stemming.WordData
Declare a covariant of Object.clone() that returns a deep copy of this object.
close(Closeable...) - Static method in class morfologik.util.FileUtils
Force any non-null closeables.
commonPrefix(byte[], byte[]) - Static method in class morfologik.tools.MorphEncoder
 
compare(byte[], int, int, byte[], int, int) - Static method in class morfologik.fsa.FSABuilder
Lexicographic order of input sequences.
complete() - Method in class morfologik.fsa.FSABuilder
Complete the automaton.
ConstantArcSizeFSA - Class in morfologik.fsa
An FSA with constant-size arc representation produced directly by FSABuilder.

D

decodeStem(ByteBuffer, byte[], int, ByteBuffer, DictionaryMetadata) - Static method in class morfologik.stemming.DictionaryLookup
Decode the base form of an inflected word and save its decoded form into a byte buffer.
DEFAULT_ANNOTATION - Static variable in class morfologik.fsa.FSA5
Default annotation byte.
DEFAULT_FILLER - Static variable in class morfologik.fsa.FSA5
Default filler byte.
defaultDictionaries - Static variable in class morfologik.stemming.Dictionary
Default loaded dictionaries.
Dictionary - Class in morfologik.stemming
A dictionary combines FSA automaton and metadata describing the internals of dictionary entries' coding (DictionaryMetadata.
Dictionary(FSA, DictionaryMetadata) - Constructor for class morfologik.stemming.Dictionary
It is strongly recommended to use static methods in this class for reading dictionaries.
DictionaryIterator - Class in morfologik.stemming
An iterator over WordData entries of a Dictionary.
DictionaryIterator(Dictionary, CharsetDecoder, boolean) - Constructor for class morfologik.stemming.DictionaryIterator
 
DictionaryLookup - Class in morfologik.stemming
This class implements a dictionary lookup over an FSA dictionary.
DictionaryLookup(Dictionary) - Constructor for class morfologik.stemming.DictionaryLookup
Creates a new object of this class using the given FSA for word lookups and encoding for converting characters to bytes.
DictionaryMetadata - Class in morfologik.stemming
Description of attributes, their types and default values.
DictionaryMetadata(char, String, boolean, boolean, Map<String, String>) - Constructor for class morfologik.stemming.DictionaryMetadata
Creates an immutable instance of DictionaryMetadata.
dumpLine(byte[], int) - Method in class morfologik.tools.FSABuildTool
Dump input line, byte-by-byte.

E

encoding - Variable in class morfologik.stemming.DictionaryMetadata
Encoding used for converting bytes to characters and vice versa.
endPart() - Method in interface morfologik.tools.IMessageLogger
 
endPart() - Method in class morfologik.tools.WriterMessageLogger
 
ensureCapacity(ByteBuffer, int) - Static method in class morfologik.util.BufferUtils
Ensure the byte buffer's capacity.
ensureCapacity(CharBuffer, int) - Static method in class morfologik.util.BufferUtils
Ensure the char buffer's capacity.
equals(Object) - Method in class morfologik.stemming.WordData
 
equals(byte[], int, byte[], int, int) - Static method in class morfologik.util.Arrays
Compare two arrays for equality.
equals(boolean[], int, boolean[], int, int) - Static method in class morfologik.util.Arrays
Compare two arrays for equality.
equals(int[], int, int[], int, int) - Static method in class morfologik.util.Arrays
Compare two arrays for equality.
EXACT_MATCH - Static variable in class morfologik.fsa.MatchResult
The automaton has exactly one match for the input sequence.

F

FileUtils - Class in morfologik.util
Utility functions.
filler - Variable in class morfologik.fsa.FSA5
Filler character.
fillerByte - Variable in class morfologik.fsa.FSA5Serializer
 
finalStatesCount - Variable in class morfologik.fsa.FSAInfo
Number of final states (number of input sequences stored in the automaton).
FLAGS_OFFSET - Static variable in class morfologik.fsa.ConstantArcSizeFSA
Offset of the flags field inside an arc.
FLAGS_SIZE - Static variable in class morfologik.fsa.ConstantArcSizeFSA
Size of the flags field (constant for the builder).
FSA - Class in morfologik.fsa
This is a top abstract class for handling finite state automata.
FSA() - Constructor for class morfologik.fsa.FSA
 
fsa - Variable in class morfologik.stemming.Dictionary
FSA automaton with the compiled dictionary data.
FSA5 - Class in morfologik.fsa
FSA binary format implementation for version 5.
FSA5(InputStream) - Constructor for class morfologik.fsa.FSA5
Read and wrap a binary automaton in FSA version 5.
FSA5Serializer - Class in morfologik.fsa
Serializes in-memory FSA graphs to a binary format compatible with Jan Daciuk's fsa's package FSA5 format.
FSA5Serializer() - Constructor for class morfologik.fsa.FSA5Serializer
 
FSABuilder - Class in morfologik.fsa
Fast, memory-conservative finite state automaton builder, returning a byte-serialized ConstantArcSizeFSA (a tradeoff between construction speed and memory consumption).
FSABuilder() - Constructor for class morfologik.fsa.FSABuilder
 
FSABuilder(int) - Constructor for class morfologik.fsa.FSABuilder
 
FSABuilder.InfoEntry - Enum in morfologik.fsa
Debug and information constants.
FSABuildTool - Class in morfologik.tools
Convert from plain text input to a serialized FSA in any of the available FSABuildTool.Formats.
FSABuildTool() - Constructor for class morfologik.tools.FSABuildTool
 
FSABuildTool.Format - Enum in morfologik.tools
The serialization format to use for the binary output.
FSADumpTool - Class in morfologik.tools
This utility will dump the information and contents of a given FSA dictionary.
FSADumpTool() - Constructor for class morfologik.tools.FSADumpTool
 
FSAFinalStatesIterator - Class in morfologik.fsa
An iterator that traverses the right language of a given node (all sequences reachable from a given node).
FSAFinalStatesIterator(FSA, int) - Constructor for class morfologik.fsa.FSAFinalStatesIterator
Create an instance of the iterator for a given node.
FSAFlags - Enum in morfologik.fsa
FSA automaton flags.
FSAInfo - Class in morfologik.fsa
Compute additional information about an FSA: number of arcs, nodes, etc.
FSAInfo(FSA) - Constructor for class morfologik.fsa.FSAInfo
 
FSAInfo(int, int, int, int) - Constructor for class morfologik.fsa.FSAInfo
 
FSASerializer - Interface in morfologik.fsa
All FSA serializers to binary formats will implement this interface.
FSATraversal - Class in morfologik.fsa
This class implements some common matching and scanning operations on a generic FSA.
FSATraversal(FSA) - Constructor for class morfologik.fsa.FSATraversal
Traversals of the given FSA.
FSAUtils - Class in morfologik.fsa
Other FSA-related utilities not directly associated with the class hierarchy.
FSAUtils() - Constructor for class morfologik.fsa.FSAUtils
 
FSAUtils.IntIntHolder - Class in morfologik.fsa
 
FSAUtils.IntIntHolder(int, int) - Constructor for class morfologik.fsa.FSAUtils.IntIntHolder
 
FSAUtils.IntIntHolder() - Constructor for class morfologik.fsa.FSAUtils.IntIntHolder
 

G

getArc(int, byte) - Method in class morfologik.fsa.CFSA
getArc(int, byte) - Method in class morfologik.fsa.CFSA2
getArc(int, byte) - Method in class morfologik.fsa.ConstantArcSizeFSA
 
getArc(int, byte) - Method in class morfologik.fsa.FSA
 
getArc(int, byte) - Method in class morfologik.fsa.FSA5
getArcCount(int) - Method in class morfologik.fsa.FSA
Calculates the number of arcs of a given node.
getArcLabel(int) - Method in class morfologik.fsa.CFSA
Return the label associated with a given arc.
getArcLabel(int) - Method in class morfologik.fsa.CFSA2
Return the label associated with a given arc.
getArcLabel(int) - Method in class morfologik.fsa.ConstantArcSizeFSA
 
getArcLabel(int) - Method in class morfologik.fsa.FSA
Return the label associated with a given arc.
getArcLabel(int) - Method in class morfologik.fsa.FSA5
Return the label associated with a given arc.
getDictionary() - Method in class morfologik.stemming.DictionaryLookup
 
getEndNode(int) - Method in class morfologik.fsa.CFSA
Return the end node pointed to by a given arc.
getEndNode(int) - Method in class morfologik.fsa.CFSA2
Return the end node pointed to by a given arc.
getEndNode(int) - Method in class morfologik.fsa.ConstantArcSizeFSA
 
getEndNode(int) - Method in class morfologik.fsa.FSA
Return the end node pointed to by a given arc.
getEndNode(int) - Method in class morfologik.fsa.FSA5
Return the end node pointed to by a given arc.
getExpectedFeaturesName(String) - Static method in class morfologik.stemming.Dictionary
Returns the expected name of the metadata file, based on the name of the FSA dictionary file.
getFirstArc(int) - Method in class morfologik.fsa.CFSA
getFirstArc(int) - Method in class morfologik.fsa.CFSA2
getFirstArc(int) - Method in class morfologik.fsa.ConstantArcSizeFSA
 
getFirstArc(int) - Method in class morfologik.fsa.FSA
 
getFirstArc(int) - Method in class morfologik.fsa.FSA5
getFlags() - Method in class morfologik.fsa.CFSA
Returns a set of flags for this FSA instance.
getFlags() - Method in class morfologik.fsa.CFSA2
Returns a set of flags for this FSA instance.
getFlags() - Method in class morfologik.fsa.CFSA2Serializer
Return supported flags.
getFlags() - Method in class morfologik.fsa.ConstantArcSizeFSA
 
getFlags() - Method in class morfologik.fsa.FSA
Returns a set of flags for this FSA instance.
getFlags() - Method in class morfologik.fsa.FSA5
Returns a set of flags for this FSA instance.
getFlags() - Method in class morfologik.fsa.FSA5Serializer
Return supported flags.
getFlags() - Method in interface morfologik.fsa.FSASerializer
Returns the set of flags supported by the serializer (and the output automaton).
getForLanguage(String) - Static method in class morfologik.stemming.Dictionary
Return a built-in dictionary for a given ISO language code.
getInfo() - Method in class morfologik.fsa.FSABuilder
Return various statistics concerning the FSA and its compilation.
getNextArc(int) - Method in class morfologik.fsa.CFSA
getNextArc(int) - Method in class morfologik.fsa.CFSA2
getNextArc(int) - Method in class morfologik.fsa.ConstantArcSizeFSA
 
getNextArc(int) - Method in class morfologik.fsa.FSA
 
getNextArc(int) - Method in class morfologik.fsa.FSA5
getRightLanguageCount(int) - Method in class morfologik.fsa.CFSA
getRightLanguageCount(int) - Method in class morfologik.fsa.CFSA2
getRightLanguageCount(int) - Method in class morfologik.fsa.FSA
 
getRightLanguageCount(int) - Method in class morfologik.fsa.FSA5
Returns the number encoded at the given node.
getRootNode() - Method in class morfologik.fsa.CFSA
Returns the start node of this automaton.
getRootNode() - Method in class morfologik.fsa.CFSA2
getRootNode() - Method in class morfologik.fsa.ConstantArcSizeFSA
 
getRootNode() - Method in class morfologik.fsa.FSA
 
getRootNode() - Method in class morfologik.fsa.FSA5
Returns the start node of this automaton.
getSequences(int) - Method in class morfologik.fsa.FSA
Returns an iterator over all binary sequences starting at the given FSA state (node) and ending in final nodes.
getSequences() - Method in class morfologik.fsa.FSA
An alias of calling FSA.iterator() directly (FSA is also Iterable).
getSerializer() - Method in enum morfologik.tools.FSABuildTool.Format
 
getStem() - Method in class morfologik.stemming.WordData
 
getStemBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
Copy the stem's binary data (no charset decoding) to a custom byte buffer.
getTag() - Method in class morfologik.stemming.WordData
 
getTagBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
Copy the tag's binary data (no charset decoding) to a custom byte buffer.
getWord() - Method in class morfologik.stemming.WordData
 
getWordBytes(ByteBuffer) - Method in class morfologik.stemming.WordData
Copy the inflected word's binary data (no charset decoding) to a custom byte buffer.
go(CommandLine) - Method in class morfologik.tools.FSABuildTool
Command line entry point after parsing arguments.
go(CommandLine) - Method in class morfologik.tools.FSADumpTool
Command line entry point after parsing arguments.
gtl - Variable in class morfologik.fsa.CFSA
Number of bytes each address takes in full, expanded form (goto length).
gtl - Variable in class morfologik.fsa.FSA5
Number of bytes each address takes in full, expanded form (goto length).

H

hashCode() - Method in class morfologik.stemming.WordData
 
hasNext() - Method in class morfologik.fsa.FSAFinalStatesIterator
Returns true if there are still elements in this iterator.
hasNext() - Method in class morfologik.stemming.DictionaryIterator
 

I

IMessageLogger - Interface in morfologik.tools
 
index - Variable in class morfologik.fsa.MatchResult
Input sequence's index, interpretation depends on MatchResult.kind.
infixEncode(byte[], byte[], byte[]) - Method in class morfologik.tools.MorphEncoder
This method converts wordform, wordLemma and the tag to the form: inflected_form + MLKending + tags where '+' is a separator, M is the position of characters to be deleted towards the beginning of the inflected form ("A" means from the beginning, "B" from the second character, "C" - from the third one, and so on), L is the number of characters to be deleted from the position specified by M ("A" means none, "B" means one, "C" - 2, etc.), K is a character that specifies how many characters should be deleted from the end of the inflected form to produce the lexeme by concatenating the stripped string with the ending ("A" means none, "B' - 1, "C" - 2, and so on).
infixEncodeUTF8(String, String, String) - Method in class morfologik.tools.MorphEncoder
A UTF-8 variant of MorphEncoder.infixEncode(byte[], byte[], byte[]).
inflectionFrames() - Method in class morfologik.tools.InflectionFramesTool
 
InflectionFramesTool - Class in morfologik.tools
Calculate inflection frames from the Polish dictionary.
InflectionFramesTool() - Constructor for class morfologik.tools.InflectionFramesTool
 
initializeOptions(Options) - Method in class morfologik.tools.FSABuildTool
 
initializeOptions(Options) - Method in class morfologik.tools.FSADumpTool
Command line options for the tool.
isArcFinal(int) - Method in class morfologik.fsa.CFSA
Returns true if the destination node at the end of this arc corresponds to an input sequence created when building this automaton.
isArcFinal(int) - Method in class morfologik.fsa.CFSA2
Returns true if the destination node at the end of this arc corresponds to an input sequence created when building this automaton.
isArcFinal(int) - Method in class morfologik.fsa.ConstantArcSizeFSA
 
isArcFinal(int) - Method in class morfologik.fsa.FSA
Returns true if the destination node at the end of this arc corresponds to an input sequence created when building this automaton.
isArcFinal(int) - Method in class morfologik.fsa.FSA5
Returns true if the destination node at the end of this arc corresponds to an input sequence created when building this automaton.
isArcLast(int) - Method in class morfologik.fsa.CFSA
Returns true if this arc has NEXT bit set.
isArcLast(int) - Method in class morfologik.fsa.CFSA2
Returns true if this arc has NEXT bit set.
isArcLast(int) - Method in class morfologik.fsa.FSA5
Returns true if this arc has LAST bit set.
isArcTerminal(int) - Method in class morfologik.fsa.CFSA
Returns true if this arc does not have a terminating node (@link FSA.getEndNode(int) will throw an exception).
isArcTerminal(int) - Method in class morfologik.fsa.CFSA2
Returns true if this arc does not have a terminating node (@link FSA.getEndNode(int) will throw an exception).
isArcTerminal(int) - Method in class morfologik.fsa.ConstantArcSizeFSA
 
isArcTerminal(int) - Method in class morfologik.fsa.FSA
Returns true if this arc does not have a terminating node (@link FSA.getEndNode(int) will throw an exception).
isArcTerminal(int) - Method in class morfologik.fsa.FSA5
Returns true if this arc does not have a terminating node (@link FSA.getEndNode(int) will throw an exception).
isLabelCompressed(int) - Method in class morfologik.fsa.CFSA
Returns true if the label is compressed inside flags byte.
isNextSet(int) - Method in class morfologik.fsa.CFSA
 
isNextSet(int) - Method in class morfologik.fsa.CFSA2
 
isNextSet(int) - Method in class morfologik.fsa.FSA5
 
isSet(int, FSAFlags) - Static method in enum morfologik.fsa.FSAFlags
Returns true if the corresponding flag is set in the bit set.
IStemmer - Interface in morfologik.stemming
A generic "stemmer" interface in Morfologik.
iterator() - Method in class morfologik.fsa.FSA
Returns an iterator over all binary sequences starting from the initial FSA state (node) and ending in final nodes.
iterator() - Method in class morfologik.stemming.DictionaryLookup
Return an iterator over all WordData entries available in the embedded Dictionary.
iterator() - Method in class morfologik.stemming.PolishStemmer
Iterates over all dictionary forms stored in this stemmer.

K

kind - Variable in class morfologik.fsa.MatchResult
One of the match kind constants defined in this class.

L

LABEL_OFFSET - Static variable in class morfologik.fsa.ConstantArcSizeFSA
Offset of the label field inside an arc.
LABEL_SIZE - Static variable in class morfologik.fsa.ConstantArcSizeFSA
Size of the label field (constant for the builder).
labelMapping - Variable in class morfologik.fsa.CFSA
Label mapping for arcs of type (1) (see class documentation).
labelMapping - Variable in class morfologik.fsa.CFSA2
Label mapping for M-indexed labels.
Launcher - Class in morfologik.tools
A launcher for other command-line tools.
Launcher() - Constructor for class morfologik.tools.Launcher
 
LEXICAL_ORDERING - Static variable in class morfologik.fsa.FSABuilder
Comparator comparing full byte arrays consistently with FSABuilder.compare(byte[], int, int, byte[], int, int).
log(String) - Method in interface morfologik.tools.IMessageLogger
Log progress to the console.
log(String, Object) - Method in interface morfologik.tools.IMessageLogger
Log a two-part message.
log(String) - Method in class morfologik.tools.WriterMessageLogger
 
log(String, Object) - Method in class morfologik.tools.WriterMessageLogger
 
lookup(CharSequence) - Method in class morfologik.stemming.DictionaryLookup
Searches the automaton for a symbol sequence equal to word, followed by a separator.
lookup(CharSequence) - Method in interface morfologik.stemming.IStemmer
Returns a list of WordData entries for a given word.
lookup(CharSequence) - Method in class morfologik.stemming.PolishStemmer
Returns a list of WordData entries for a given word.

M

main(String[]) - Static method in class morfologik.tools.FSABuildTool
Command line entry point.
main(String[]) - Static method in class morfologik.tools.FSADumpTool
Command line entry point.
main(String[]) - Static method in class morfologik.tools.InflectionFramesTool
 
main(String[]) - Static method in class morfologik.tools.Launcher
Command line entry point.
match(MatchResult, byte[], int, int, int) - Method in class morfologik.fsa.FSATraversal
Same as FSATraversal.match(byte[], int, int, int), but allows passing a reusable MatchResult object so that no intermediate garbage is produced.
match(byte[], int, int, int) - Method in class morfologik.fsa.FSATraversal
Finds a matching path in the dictionary for a given sequence of labels from sequence and starting at node node.
match(byte[], int) - Method in class morfologik.fsa.FSATraversal
 
match(byte[]) - Method in class morfologik.fsa.FSATraversal
 
MatchResult - Class in morfologik.fsa
A matching result returned from FSATraversal.
MatchResult() - Constructor for class morfologik.fsa.MatchResult
 
metadata - Variable in class morfologik.stemming.Dictionary
Metadata associated with the dictionary.
metadata - Variable in class morfologik.stemming.DictionaryMetadata
Other meta data not included above.
METADATA_FILE_EXTENSION - Static variable in class morfologik.stemming.Dictionary
Expected metadata file extension.
morfologik.fsa - package morfologik.fsa
 
morfologik.stemming - package morfologik.stemming
 
morfologik.tools - package morfologik.tools
 
morfologik.util - package morfologik.util
 
MorphEncoder - Class in morfologik.tools
A class that converts tabular data to fsa morphological format.
MorphEncoder() - Constructor for class morfologik.tools.MorphEncoder
 
MorphEncoder(byte) - Constructor for class morfologik.tools.MorphEncoder
 

N

next() - Method in class morfologik.fsa.FSAFinalStatesIterator
 
next() - Method in class morfologik.stemming.DictionaryIterator
 
NO_MATCH - Static variable in class morfologik.fsa.MatchResult
The automaton has no match for the input sequence.
node - Variable in class morfologik.fsa.MatchResult
Automaton node, interpretation depends on the MatchResult.kind.
nodeCount - Variable in class morfologik.fsa.FSAInfo
Number of nodes in the automaton.
nodeDataLength - Variable in class morfologik.fsa.CFSA
The length of the node header structure (if the automaton was compiled with NUMBERS option).
nodeDataLength - Variable in class morfologik.fsa.FSA5
The length of the node header structure (if the automaton was compiled with NUMBERS option).

O

openInputStream(String) - Static method in class morfologik.util.ResourceUtils
Returns an input stream to the resource.

P

perfectHash(byte[], int, int, int) - Method in class morfologik.fsa.FSATraversal
Calculate perfect hash for a given input sequence of bytes.
perfectHash(byte[]) - Method in class morfologik.fsa.FSATraversal
 
PolishStemmer - Class in morfologik.stemming
A dictionary-based stemmer for the Polish language.
PolishStemmer() - Constructor for class morfologik.stemming.PolishStemmer
This constructor is initialized with a built-in dictionary or fails with a runtime exception if the dictionary is not available.
prefixEncode(byte[], byte[], byte[]) - Method in class morfologik.tools.MorphEncoder
This method converts wordform, wordLemma and the tag to the form: inflected_form + LKending + tags where '+' is a separator, L is the number of characters to be deleted from the beginning of the word ("A" means none, "B" means one, "C" - 2, etc.), K is a character that specifies how many characters should be deleted from the end of the inflected form to produce the lexeme by concatenating the stripped string with the ending ("A" means none, "B' - 1, "C" - 2, and so on).
prefixEncodeUTF8(String, String, String) - Method in class morfologik.tools.MorphEncoder
A UTF-8 variant of MorphEncoder.prefixEncode(byte[], byte[], byte[]) This method converts wordform, wordLemma and the tag to the form: inflected_form + LKending + tags where '+' is a separator, L is the number of characters to be deleted from the beginning of the word ("A" means none, "B" means one, "C" - 2, etc.), K is a character that specifies how many characters should be deleted from the end of the inflected form to produce the lexeme by concatenating the stripped string with the ending ("A" means none, "B' - 1, "C" - 2, and so on).
printUsage() - Method in class morfologik.tools.FSABuildTool
 

R

read(InputStream) - Static method in class morfologik.fsa.FSA
A factory for reading automata in any of the supported versions.
read(File) - Static method in class morfologik.stemming.Dictionary
Attempts to load a dictionary using the path to the FSA file and the expected metadata extension.
read(URL) - Static method in class morfologik.stemming.Dictionary
Attempts to load a dictionary using the URL to the FSA file and the expected metadata extension.
readAndClose(InputStream, InputStream) - Static method in class morfologik.stemming.Dictionary
Attempts to load a dictionary from opened streams of FSA dictionary data and associated metadata.
readByte(InputStream) - Static method in class morfologik.util.FileUtils
Read exactly one byte from the input stream.
readFully(InputStream) - Static method in class morfologik.util.FileUtils
Reads all bytes from an input stream (until EOF).
readFully(InputStream, byte[]) - Static method in class morfologik.util.FileUtils
Read enough bytes to fill array If there are not enough bytes, throw an exception.
readInt(InputStream) - Static method in class morfologik.util.FileUtils
Read exactly 4 bytes from the input stream.
readShort(InputStream) - Static method in class morfologik.util.FileUtils
Read exactly 2 bytes from the input stream.
referenceEquals(Object[], int, Object[], int, int) - Static method in class morfologik.util.Arrays
Compare two lists of objects for reference-equality.
remove() - Method in class morfologik.fsa.FSAFinalStatesIterator
Not implemented in this iterator.
remove() - Method in class morfologik.stemming.DictionaryIterator
 
ResourceUtils - Class in morfologik.util
Resource management utilities.
restartFrom(int) - Method in class morfologik.fsa.FSAFinalStatesIterator
Restart walking from node.
rightLanguage(FSA, int) - Static method in class morfologik.fsa.FSAUtils
All byte sequences generated as the right language of state.
rightLanguageForAllStates(FSA) - Static method in class morfologik.fsa.FSAUtils
Calculate the size of right language for each state in an FSA.

S

separator - Variable in class morfologik.stemming.DictionaryMetadata
A separator character between fields (stem, lemma, form).
SEQUENCE_IS_A_PREFIX - Static variable in class morfologik.fsa.MatchResult
The sequence is a prefix of at least one sequence in the automaton.
serialize(FSA, T) - Method in class morfologik.fsa.CFSA2Serializer
Serializes any FSA to CFSA2 stream.
serialize(FSA, T) - Method in class morfologik.fsa.FSA5Serializer
Serialize root state s to an output stream in FSA5 format.
serialize(FSA, T) - Method in interface morfologik.fsa.FSASerializer
Serialize a finite state automaton to an output stream.
size - Variable in class morfologik.fsa.FSAInfo
Arcs size (in serialized form).
standardEncode(byte[], byte[], byte[]) - Method in class morfologik.tools.MorphEncoder
This method converts the wordForm, wordLemma and tag to the form: wordForm + Kending + tags where '+' is a separator, K is a character that specifies how many characters should be deleted from the end of the inflected form to produce the lexeme by concatenating the stripped string with the ending.
standardEncodeUTF8(String, String, String) - Method in class morfologik.tools.MorphEncoder
A UTF-8 variant of MorphEncoder.standardEncode(byte[], byte[], byte[]) This method converts the wordForm, wordLemma and tag to the form: wordForm + Kending + tags where '+' is a separator, K is a character that specifies how many characters should be deleted from the end of the inflected form to produce the lexeme by concatenating the stripped string with the ending.
startPart(String) - Method in interface morfologik.tools.IMessageLogger
Log message header and save current time.
startPart(String) - Method in class morfologik.tools.WriterMessageLogger
 
StateVisitor - Interface in morfologik.fsa
State visitor.

T

TARGET_ADDRESS_SIZE - Static variable in class morfologik.fsa.ConstantArcSizeFSA
Size of the target address field (constant for the builder).
toDot(FSA, int) - Static method in class morfologik.fsa.FSAUtils
Returns the right-language reachable from a given FSA node, formatted as an input for the graphviz package (expressed in the dot language).
toDot(Writer, FSA, int) - Static method in class morfologik.fsa.FSAUtils
Saves the right-language reachable from a given FSA node, formatted as an input for the graphviz package (expressed in the dot language), to the given writer.
toString() - Method in enum morfologik.fsa.FSABuilder.InfoEntry
 
toString() - Method in class morfologik.fsa.FSAInfo
 
toString(byte[], int, int) - Static method in class morfologik.util.Arrays
Convert an array of strings to bytes.
toString(ByteBuffer) - Static method in class morfologik.util.BufferUtils
Convert a byte buffer to a string in platform default encoding.

U

usesInfixes - Variable in class morfologik.stemming.DictionaryMetadata
True if the dictionary was compiled with infix compression.
usesPrefixes - Variable in class morfologik.stemming.DictionaryMetadata
True if the dictionary was compiled with prefix compression.

V

valueOf(String) - Static method in enum morfologik.fsa.FSABuilder.InfoEntry
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum morfologik.fsa.FSAFlags
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum morfologik.tools.FSABuildTool.Format
Returns the enum constant of this type with the specified name.
values() - Static method in enum morfologik.fsa.FSABuilder.InfoEntry
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum morfologik.fsa.FSAFlags
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum morfologik.tools.FSABuildTool.Format
Returns an array containing the constants of this enum type, in the order they are declared.
VERSION - Static variable in class morfologik.fsa.CFSA
Automaton header version value.
VERSION - Static variable in class morfologik.fsa.CFSA2
Automaton header version value.
VERSION - Static variable in class morfologik.fsa.FSA5
Automaton version as in the file header.
visitAllStates(T) - Method in class morfologik.fsa.FSA
Visit all states.
visitInPostOrder(T) - Method in class morfologik.fsa.FSA
Same as FSA.visitInPostOrder(StateVisitor, int), starting from root automaton node.
visitInPostOrder(T, int) - Method in class morfologik.fsa.FSA
Visits all states reachable from node in postorder.
visitInPreOrder(T) - Method in class morfologik.fsa.FSA
Same as FSA.visitInPreOrder(StateVisitor, int), starting from root automaton node.
visitInPreOrder(T, int) - Method in class morfologik.fsa.FSA
Visits all states in preorder.

W

withAnnotationSeparator(byte) - Method in class morfologik.fsa.CFSA2Serializer
 
withAnnotationSeparator(byte) - Method in class morfologik.fsa.FSA5Serializer
Supports built-in annotation separator.
withAnnotationSeparator(byte) - Method in interface morfologik.fsa.FSASerializer
Supports built-in annotation separator.
withFiller(byte) - Method in class morfologik.fsa.CFSA2Serializer
 
withFiller(byte) - Method in class morfologik.fsa.FSA5Serializer
Supports built-in filler separator.
withFiller(byte) - Method in interface morfologik.fsa.FSASerializer
Supports built-in filler separator.
withLogger(IMessageLogger) - Method in class morfologik.fsa.CFSA2Serializer
 
withLogger(IMessageLogger) - Method in class morfologik.fsa.FSA5Serializer
Log extra messages during construction.
withLogger(IMessageLogger) - Method in interface morfologik.fsa.FSASerializer
Log extra messages during construction.
withNumbers() - Method in class morfologik.fsa.CFSA2Serializer
Serialize the automaton with the number of right-language sequences in each node.
withNumbers() - Method in class morfologik.fsa.FSA5Serializer
Serialize the automaton with the number of right-language sequences in each node.
withNumbers() - Method in interface morfologik.fsa.FSASerializer
Supports built-in right language count on nodes, speeding up perfect hash counts.
WordData - Class in morfologik.stemming
Stem and tag data associated with a given word.
writeInt(OutputStream, int) - Static method in class morfologik.util.FileUtils
 
WriterMessageLogger - Class in morfologik.tools
A logger dumping info to System.err.
WriterMessageLogger(PrintWriter) - Constructor for class morfologik.tools.WriterMessageLogger
 
writeShort(OutputStream, short) - Static method in class morfologik.util.FileUtils
 

A B C D E F G H I K L M N O P R S T U V W