Fix hordeconfig max context setting, and add Makefile flags for cuda F16/KQuants per iter. (#252)

* Fix hordeconfig maxcontext setting.

* cuda: Bring DMMV_F16 and KQUANTS_ITER Makefile flags over from llama.
This commit is contained in:
Ycros 2023-06-22 01:01:46 +10:00 committed by GitHub
parent dfdd20240c
commit b1f00fa9cc
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23
2 changed files with 9 additions and 1 deletions

View file

@ -724,7 +724,7 @@ def main(args):
sys.exit(2)
if args.hordeconfig and args.hordeconfig[0]!="":
global friendlymodelname, maxhordelen, showdebug
global friendlymodelname, maxhordelen, maxhordectx, showdebug
friendlymodelname = "koboldcpp/"+args.hordeconfig[0]
if len(args.hordeconfig) > 1:
maxhordelen = int(args.hordeconfig[1])