U AT^@sldZdZddlZddlZddlmZddlmZddlm Z Gdd d e Z Gd d d e Z ia d d ZdS)aA bottom-up tree matching algorithm implementation meant to speed up 2to3's matching process. After the tree patterns are reduced to their rarest linear path, a linear Aho-Corasick automaton is created. The linear automaton traverses the linear paths from the leaves to the root of the AST and returns a set of nodes for further matching. This reduces significantly the number of candidate nodes.z+George Boutsioukis N) defaultdict)pytree) reduce_treec@s eZdZdZeZddZdS)BMNodez?Class for a node of the Aho-Corasick automaton used in matchingcCs"i|_g|_ttj|_d|_dS)N)transition_tablefixersnextrcountidcontentselfr+/usr/lib64/python3.8/lib2to3/btm_matcher.py__init__s zBMNode.__init__N)__name__ __module__ __qualname____doc__ itertoolsr rrrrrrsrc@s8eZdZdZddZddZddZdd Zd d Zd S) BottomMatcherzgThe main matcher class. After instantiating the patterns should be added using the add_fixer methodcCs0t|_t|_|jg|_g|_td|_dS)NZRefactoringTool) setmatchrrootZnodesr logging getLoggerloggerrrrrrs  zBottomMatcher.__init__cCsH|j|t|j}|}|j||jd}|D]}|j|q2dS)zReduces a fixer's pattern tree to a linear path and adds it to the matcher(a common Aho-Corasick automaton). The fixer is appended on the matching states and called when they are reachedstartN)r appendrZ pattern_treeZget_linear_subpatternaddr)rfixerZtreeZlinear match_nodesZ match_noderrr add_fixer%s   zBottomMatcher.add_fixerc Cs|s |gSt|dtr`g}|dD]6}|j||d}|D]}|||dd|q:q$|S|d|jkrt}||j|d<n|j|d}|ddr|j|dd|d}n|g}|SdS)z5Recursively adds a linear pattern to the AC automatonrrrN) isinstancetupler"extendrr)rpatternr r$Z alternativeZ end_nodesendZ next_noderrrr"1s"  zBottomMatcher.addc Cs|j}tt}|D]}|}|rd|_|jD]$}t|tjr*|jdkr*d|_qPq*|j dkrb|j}n|j }||j kr|j |}|j D]}|| |qnH|j}|j dk r|j jrq||j kr|j |}|j D]}|| |q|j }qq|S)auThe main interface with the bottom matcher. The tree is traversed from the bottom using the constructed automaton. Nodes are only checked once as the tree is retraversed. When the automaton fails, we give it one more shot(in case the above tree matches as a whole with the rejected leaf), then we break for the next leaf. There is the special case of multiple arguments(see code comments) where we recheck the nodes Args: The leaves of the AST tree to be matched Returns: A dictionary of node matches with fixers as the keys T;FrN)rrlistZ was_checkedZchildrenr&rZLeafvaluetyperr r!parent) rZleavesZcurrent_ac_noderesultsZleafZcurrent_ast_nodeZchildZ node_tokenr#rrrrunSs8          zBottomMatcher.runcs*tdfdd|jtddS)z %d [label=%s] //%sr)rkeysprintr type_reprstrr r )ZnodeZ subnode_keyZsubnode print_noderrr7s  z*BottomMatcher.print_ac..print_node}N)r3rrrr6rprint_acs  zBottomMatcher.print_acN) rrrrrr%r"r1r9rrrrrs  "8rcCsDts8ddlm}|jD]\}}t|tkr|t|<qt||S)Nr)python_symbols) _type_reprsZpygramr:__dict__itemsr.int setdefault)Ztype_numr:namevalrrrr4s   r4)r __author__rr collectionsrrrZ btm_utilsrobjectrrr;r4rrrrs